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PREFACE 


This volume assembles 23 of my papers on mathematical logic, 1934-60. 
Twelve of them were published in the Journal of Symbolic Logic, nine 
elsewhere, two nowhere. All are technical. Still, apart from part of paper 
X, the collection is self-contained: points presupposed in one paper are 
covered in others. 

The papers were chosen with a view less to chronology than to present 
interest and utility. Ones whose substance has gone into my books were 
omitted. 

The papers are arranged by subject. The first gives history; the 
second surveys the field. The next three deal with elementary theories 
of integers or, what is in a way the same, of strings of signs. Then come 
seven papers on axiomatic set theory. One of these, from 1936, plays 
variations on Zermelo’s system. The second bases everything on in- 
clusion and abstraction. The third develops a deviant definition of 
ordered pair boasting certain virtues. The next three relate variously 
to the set theories of my ‘New foundations” and Mathematical Logic, 
and the last deals with Frege’s. 

The remaining papers comprise four on truth functions, or Boolean 
algebra, and seven on quantification. One truth-functional topic treated 
is the simplification problem, dear to computer engineers; another is 
completeness. Two of the quantification papers make the proofs of un- 
decidability and the Léwenheim-Skolem theorem easier to follow than 
usual. Others develop streamlined deductive techniques. One paper 
shows how to reduce predicates to a single dyadic one. A final paper 
translates quantification into a notation without variables. 

Twenty-three of my papers, including some of the present lot, were 
privately reprinted by offset in 1963 on Dr. Dagfinn Fgllesdal’s initiative 
and with my hearty approval, for the use of his pupils and mine. In the 
light of that operation I came to see that there was a place for an edited 
and published assortment on a larger scale. For his part in thus sparking 
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the present volume, as well as a more discursive volume that Random 
House is bringing out simultaneously under the title The Ways of Para- 
dox, I am grateful to Dr. Fdllesdal. 

The assembling and editing of the two books occupied the fifth of 
my five good months as Fellow of the Center for Advanced Studies at 
Wesleyan University. I am grateful to the administration of the Center 
for generous support and to the staff for cheerful assistance. 

I thank the ten editors and publishers by whose permission vari- 
ous of my papers are reprinted in this volume. For the particulars see 
the bottom of the first page of each paper. 


wW.Vv.Q. 


Harvard, Mass. 
August, 1966 
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a Whitehead and the Rise 
of Modern Logic 


“Algebra,” for centuries, meant numerical algebra: arithmetic for- 
mulated with help of variables. Complex numbers came as an ex- 
tension, but an extension still of number, arithmetically motivated ; 
not as a departure from number. During the short period from 1843 
to 1847, then, three utterly non-numerical theories came to be 
created in the partial image of numerical algebra: Sir William 
Rowan Hamilton’s theory of quaternions, Hermann Grassmann’s 
theory of extension (Ausdehnungslehre), and George Boole’s alge- 
bra of logic. Mathematicians began to recognize that there was no 
necessary connection between the algebraic type of approach and 
the numerical type of subject matter, and indeed that the algorith- 
mic laws of an algebra might fruitfully be studied in abstraction 
from all subject matter. Benjamin Peirce’s Linear Associative Alge- 
bras (1870), a generalization of Hamilton’s quaternion theory, was 
explicitly a comparative study of certain uninterpreted algebras. 
Thus it was that a wide new field came to be cleared which is today 
undergoing intensive cultivation under the name of abstract alge- 
bra. 


Reprinted from P. A. Schilpp, ed., The Philosophy of Alfred North Whitehead 
(Library of Living Philosophers, 1941) by permission of Professor Schilpp. 
The Library of Living Philosophers is now handled by the Open Court Pub- 
lishing Co., La Salle, Il]. The end of the second paragraph of the paper as here 
presented is new. 
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It was as a pioneer in this field that Whitehead wrote his first 
book, A Treatise on Universal Algebra (1898). In Book I of this 
work he undertook to characterize such algebraic notions as 
equivalence, addition, and multiplication along lines general 
enough to suit any algebra, irrespective of the more special laws of 
the algebra and irrespective of the interpretation. The laws: 


aty=ytea, @tyte=rct+(yt+z2), cyte) = zy + zz, 


e.g., he regarded as common to algebras generally; whereas alge- 
bras might differ in point of various other laws, including: 
sy= yz, (ryje=ayz), rtrd. 

Algebras violating the law ‘x + x = 2’ he called numerical, and 
classified in turn into species substantially as follows. Some algebras 
allow multiplication only between elements belonging to appro- 
priately related mantfolds; and an algebra is of nth species if n 
manifolds have to be thus distinguished. 

The projected second volume of Universal Algebra, which never 
appeared, was to deal with linear algebras, or algebras of first 
species. The existing Volume I is concerned, in the last five of its 
seven Books, with algebras of higher species. This portion of the 
work constitutes a detailed investigation of the calculus of exten- 
sion founded by Grassmann. Book IT of the volume deals with a 
non-numerical algebra, namely the Boolean algebra of logic. This 
Book will concern us to the exclusion of the five succeeding Books. 
But meanwhile there is an aspect of Book I that calls for discussion: 
the general doctrine of equivalence. 

Whitehead regarded ‘=’ as expressing a relation of equivalence, 
short of identity. His defense was that laws such as ‘ey + y = 
y + x’ would otherwise, like ‘z = z’, makes no assertion at all. This 
reasoning, which hints strongly of Wittgenstein’s more recent 
polemic against identity,! loses its force if we attend closely to the 
distinction between notation and subject matter. Let us tentatively 
suppose, contrary to Whitehead as of 1898, that ‘a + y =y+2’ 
does hold as a genuine identity; i.e., that the order of summands is 
wholly immaterial. A notation of addition more suggestive than 
‘zs + y’, then, would consist in simply superimposing ‘z’ and ‘y’ in 
the manner of a monogram. This notation quite properly refrains 
from suggesting any order of summands; and there ceases to be any 
analogue of ‘x + y = y+ 2’, the nearest approach being of the 


1 Paragraph 5-5303. 
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vacuous type ‘z = 2’. Now the one objection to this procedure is 
the expense of casting monograms; and thus it is that we revert to 
a linear notation which imposes an arbitrary notational order on 
summands. The printer foists upon us a redundant notation, is- 
suing in synonyms. One and the same sum can now be expressed 
in two ways, ‘x + y’ and ‘y + 2’. The law ‘xz + y = y + 2’ comes 
to be needed as a means of neutralizing this excess of notation over 
subject matter. The law thus preserves a consequential role, even 
with ‘=’ construed in the strict sense of identity. 

It is not easy to imagine a notation capable of absorbing various 
other laws, e.g., ‘x(y + 2) = xy + xz’, in the manner in which the 
monogram notation absorbed ‘xz + y = y + 2’; nor is it easy to 
imagine a notation capable even of absorbing ‘x + y = y + 2’ and 
yet leaving ‘x + x = 2’ unprejudiced. For most purposes, nota- 
tions teeming with synonyms are forced upon us by circumstances 
yet more compelling than the cost of monograms. Hence the utility 
of the identity concept. 

Continuing his discussion of equivalence (‘=’), Whitehead 
pointed out that an equivalence formula or equation implies re- 
placeability of the one side by the other in all algebraic contexts, 
whereas this does not hold for certain non-algebraic contexts. In 
the differential calculus, e.g., we cannot assert that if 2? = 8 then 
dz? /dx = d8/dx; for dx*/dx is 3x? (hence 12 where x? = 8) whereas 
d8/dzx is 0. This was no doubt further ground for his construing ‘=’ 
as meaning something short of identity; for if x? 7s simply 8, so that 
dx?/dx and d8/dx are d/dz of one and the same thing, then surely 
ae /de = d8/dzx. 

As is now known, however, this apparent anomaly calls for no 
departure from the strict identity relation. It is functions or rela- 
tions, not numbers, that have derivatives. The expression ‘dx'/dz’, 
which may for logical purposes be rendered more clearly in the 
fashion ‘D ),(2z*)’, designates the derivative not of the number 2? 
but of a relation \,(x*)—the relation ‘‘cube of.’’? Likewise ‘d8/dz’ 
or ‘D ),8’ designates the derivative not of the number 8 but of the 
relation A,8—the relation that 8 bears to everything.’ The ‘z’ of 
‘\z(z*)’ and similar contexts is a so-called bound (or apparent) 
variable, a relative pronoun in effect, having no relevance beyond 


2 Cf. §8 below. For the logical definition of the derivative see my Mathe- 
matical Logic, 279. 

3 Incidentally dzx?/dz or Dh, (z°) is itself not a number, 322, but a relation in 
turn, viz. A.(3z?)—“‘triple of square of’’; and d8/dz or DX,8 is not 0 but A,0. 
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the immediate ‘\’ context. Both ‘\,(z*)’ and ‘d,8’ are constants re- 
ferring to fixed relations, and they are quite unaffected by special 
numerical hypotheses such as ‘2? = 8’. 

Whitehead’s version of ‘=’ as equivalence-in-diversity does not 
reappear in his later work. Hinted in the contrast which he drew on 
this early occasion between algebra and the differential calculus, 
still, there is this sound characterization of an algebra: its terms 
are constructed from one another without any binding of variables. 
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The elements of the Boolean algebra are classes. The product xy 
is construed as the class whose members are the common members 
of the class x and y, and the sum z + y is construed as the class 
obtained by pooling the members of x and y.‘ A further notion is 
that of the complement , whose members comprise everything ex- 
cept the members of x. Finally there is the empty class 0, and the 
exhaustive class 1. Equations constructed of these materials admit 
of an algorithm which is rather like that of the familiar numerical 
algebra, but simpler. 

The algebra is subject, as Schréder showed in 1877,° to a neat 
principle of duality: every law remains true when ‘plus’ is switched 
with ‘times’ and ‘0’ with ‘1’. Another important principle is that of 
development, which goes back to Boole: any function f(x) of the 
algebra is expressible as a sum az + b%, wherein the coefficients a 
and b are respectively f(1) and f(0); likewise any function f(z, y) is 
expressible as a sum: 


(1) axy + bty + cry + dZz9, 


wherein the coefficients are respectively f(1, 1), f(0, 1), f(1, 0), and 
f(0, 0); and correspondingly for higher functions f(z, y, z), f(x, y, 
z, w), ete. 

Any equation of the logical algebra can be converted into 
another whose right member is ‘0’; for ‘z = w’ is equivalent to 
‘2 + Zw = 0’. Having thus converted any equation, we may pro- 

‘This version of the sum goes back to Leibniz, 94f. In Boole’s own books 
(1847, 1854) the sum was construed in a slightly different and less convenient 
ee but Leibniz’s version was revived by W. 8S. Jevons (1864), and became 
usual. 


5 Whitehead gives the credit rather to C. 8S. Peirce. For a clarification of the 
point see Alonzo Church’s review of E. T. Bell. 


O_O — 


WHITEHEAD AND MODERN LOGIC 7 


ceed to develop its left member after the manner exemplified in (1). 
Any logical equation thus admits of a standard form wherein a sort 
of polynomial is equated to 0. Since there are such striking analogies 
between equations of logic and those of ordinary algebra, the prob- 
lem of solving logical equations for unknowns was among the first 
to command interest in connection with Boole’s algebra. 

The general equation in one unknown, ‘az + bz = 0’, can be sat- 
isfied just in case ab = 0; and, where this condition is fulfilled, the 
equation proves to be satisfied by dc + bé for every choice of c, and 
by nothing else. The nearest thing to a solution of ‘az + bz = 0’, 
in short, is ‘x = de + bé’. This was known to Boole. The “solution”’ 
—in the same sense—of the general equation in two unknowns was 
given by Schréder in 1890. 

Whitehead sets all these matters forth systematically in Chap- 
ters I and II of his Book II, and derives new results of his own. 
Among these is the solution of the general equation in three or 
more unknowns; also the general solution of n simultaneous equa- 
tions in one unknown. A principle which he establishes in the 
course of these developments is that a function f(z, y,z, ... ),in 
any number of unknowns, is capable of assuming all and only those 
values which can be assumed by this function in one unknown: 


abe... k+ulatb+e+...+h), 


where a, b,c, . . . , kK are the successive coefficients of f according 
to the law of development. These findings are accompanied by 
numerous others in the same vein. 

In his “‘Memoir on the algebra of symbolic logic” of three years 
later, Whitehead delves further into the theory of equations and 
theory of functions of Boolean algebra. He investigates the prob- 
lem of factoring any function into a product of so-called linear 
primes. By a linear prime he means any function f(z, y, z,... ) 
that is expressible in the fashion: 


de+aé+by+bgti+cz+.... 


The importance of this notion is reflected in these principles: (i) a 
function capable of the value 0 is a linear prime if and only if no 
factor of it (other than itself) is capable of the value 0; (ii) no sum 
of distinct linear primes, with the same variables, is capable of the 
value 0. In arriving at various laws of factoring, Whitehead makes 
extensive use of ¢nvariants—in striking analogy to the procedure 
of ordinary theory of equations. His invariants are symmetric func- 
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tions of the coefficients a, b, etc., obtained under the principle of 
development. He shows on the basis of the principle of duality that 
another theory, closely parallel to all this, subsists for what he calls 
separable primes—complements of linear primes. This parallel 
theory is concerned not with factoring but with resolving into 
summands. 

The second part of the article deals with transformations of func- 
tions. Any transformation of x and y into u and v is represented by 
a pair of equations: 


(2) a filu, v), oe falu, v). 


In case the equations (2) happen to be equivalent to another pair 
of the form: 


U= Fy(a, ae Ce F(z, y), 


he calls the transformation a substitution. He shows that the sub- 
stitutions, unlike the transformations generally, constitute a group; 
and that, given any function f, the substitutions that leave f un- 
changed constitute a group by themselves—the zdentzcal group of f, 
as he calls it. Among the results that he establishes with regard to 
substitution groups, perhaps the most notable is this: if the identi- 
cal groups of two functions f; and f, have nothing in common (ex- 
cept the vacuous substitution), then (2) above is itself a substitu- 
tion. 

It was while this paper was at press that Whitehead came upon 
Giuseppe Peano’s powerful mathematical logic; and forthwith he 
embarked with Bertrand Russell on Principia Mathematica. But he 
did not turn his back on Boolean algebra without according it one 
more paper, by way of extending his old theory of equations with 
the help of the new Peano devices. In this paper, ‘“The logic of rela- 
tions, logical substitution groups, and cardinal numbers,”’ White- 
head formulated the general notion of a Boolean equation rigor- 
ously within Peano’s formal notation. The following verbal sketch, 
though diverging somewhat from Whitehead’s meaning, depicts the 
general type of approach. Given an expression of equational form, 
in n variables, we may think of each of its solutions as an n-place 
sequence of classes which, as values of the respective variables, 
make the expression true. Such a sequence may be explained in 
turn as a class of n pairs, wherein the first of the classes is paired 
with the number 1, the second with 2, etc. Finally an equation (as 
distinct from the equational expression) may be conceived as the 


WHITEHEAD AND MODERN LOGIC 9 


class of the solutions. Equations become identified thus with cer- 
tain classes of classes of class-number pairs; and the exact defini- 
tion can be couched in Peano’s symbols. 

An interesting effect of defining equation along such lines was 
that the idea of an equation in infinitely many variables came to 
have meaning. Whitehead then extended various of his earlier re- 
sults to apply to equations in this widened sense. He formulated a 
solution of the general equation, and he determined the number of 
possible solutions. As an instrument in these investigations he used 
Georg Cantor’s theory of infinite numbers—a theory that had it- 
self been the object meanwhile of investigations on Whitehead’s 
part (cf. §9). 


3 


Such, in outline, were Whitehead’s mathematically oriented 
studies of Boolean algebra. But he also directed his attention, even 
in Universal Algebra, to the more strictly logical type of problem: 
that which concerns analysis and systematization of the basic forms 
of discourse. The equations of Boolean algebra provide directly for 
universal statements, to the effect that everything or that nothing 
belongs to a class x (viz., ‘x = 1’ and ‘x = 0’); but they do not 
provide thus directly for existential statements, to the effect that 
something does or does not belong to x. Accordingly Whitehead® 
introduced a curious quasi-term ‘7’, which he thought of as a 
modification of ‘1’ to the following effect: 17, like x1, is x; but the 
use of the notation ‘xj’ is understood as implying incidentally the 
further information that something belongs to z. Analogously he 
introduced another quasi-term ‘w’, thought of as a modification of 
‘0’ to the following effect: x + w, like z + 0, is z, but the use of the 
notation ‘z + w’ implies incidentally that something does not be- 
long to z. When used in isolation, rather than written in an equa- 
tion, ‘rj’ drops its status of term altogether and comes to function 
simply as a statement to the effect that something belongs to x; and 
correspondingly for ‘x + w’. The expression ‘xj’ or ‘x + w’ may 
also function simply as a statement when it stands as a member of 
an equation, provided the adjacent equality sign is modified in the 
fashion ‘=’ and read ‘if and only if’. 

Certain of the laws of Boolean algebra remain valid when ‘7’ and 

¢ Universal Algebra, Bk. II, Chap. 3. 
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‘w occur among the terms, but the analogy is not thoroughgoing. 
Additional rules are formulated to guide the manipulation of ‘7’ and 
‘w’, and even some additional auxiliary notations are adopted. See- 
ing how this engine grates and creaks, we may wonder with Vacca 
why Whitehead did not favor the simple expedient of expressing 
existence statements by means of inequalities: ‘x ~ 0’ for ‘Some- 
thing belongs to x’, and ‘x # 1’ for ‘Something does not belong to 
z’. Perhaps the answer is that Whitehead was viewing algebras 
strictly as systems of equations, as opposed to inequalities. 

The traditional formal logic falls neatly into two parts, which 
may for brevity be called the categorical and the hypothetical. The 
categorical logic embraces the theory of immediate inference and 
the categorical syllogism. The hypothetical logic received only 
fragmentary treatment in the Aristotelian tradition, under the head 
of hypothetical and disjunctive syllogisms; its fuller development 
took place rather at the hands of the Stoic logicians and their fol- 
lowers—notably Chrysippus in ancient times and Petrus Hispanus 
in the Middle Ages.’ Now the algebra of classes deals, in a simple 
and systematic way, with the matters that were dealt with more 
haltingly and inelegantly in the old categorical logic. Whitehead 
accordingly devotes a chapter® to deriving the various moods of 
categorical syllogisms. But, as Boole was aware, the algebra can be 
made also to provide for hypothetical logic, simply by dropping the 
classial interpretation of the algebra in favor of a propositional 
interpretation. To this course Whitehead devotes a chapter in 
turn.® 

Under the propositional interpretation of the algebra, £ is con- 
strued as the denial of the proposition z; and xy is construed as the 
conjunction and x + yas the alternation of the propositions x and y. 
Thus Z is true just in case z is false, ry is true just in case x and y 
are both true, and x + y is true just in case one or both of x and y 
are true. In construing ‘=’ propositionally, there was some diver- 
gence among Boole and his followers. The simplest choice is to con- 
strue ‘x = y’ as true whenever x and y are both true or both false. 
So construed, ‘x = y’ reduces to ‘xy + <7’. The sign ‘=’ becomes 
dispensable, and so do ‘1’ and ‘0’; in fact ‘x = 1’ and ‘x = 0’ reduce 
to ‘x’ and ‘z’. But Whitehead chose rather to construe ‘x = y’ as 
true only in case “any motives . . . to assent, which on presenta- 

7See Lukasiewicz, 1935-1936. 


8 Universal Algebra, Bk. II, Chap. 4. 
* Ibid., Chap. 5. 
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tion to the mind induce assent to z, also necessarily induce assent 
to y and conversely.’”° For him, ‘z = 1’ does not reduce to ‘2’; it 
affirms rather that z is not merely true, but self-evident. Corre- 
spondingly ‘x = 0’ affirms that x is “‘self-condemned.” Again ‘zj’ 
and ‘zs + w’ (which might have been ‘x ~ 0’ and ‘x ¥ 1’) deny 
respectively that x is self-condemned and that z is self-evident. Sub- 
stantially a logic of modalities—necessity, impossibility, possibility, 
contingency, equivalence—is thus forthcoming. 

Let us take stock again of the subject matter of the algebra. 
Originally, under the classial interpretation, the elements z, y, etc., 
of the algebra were said to be classes; i.e., the letters ‘x’, ‘y’, etc., 
were thought of as replaceable by any actual names of specific 
classes. Now, under the propositional interpretation, the elements 
Z, y, etc., are said to be propositions; and this means, correspond- 
ingly, that ‘x’, ‘y’, etc., are to be replaceable by names of proposi- 
tions. Actually, in getting specific instances of the algebraic laws 
under the so-called propositional interpretation, the letters are 
ordinarily replaced by statemenis; so we seem called upon to under- 
stand ‘‘propositions’” no longer as statements, as was traditionally 
done, but rather as certain other entities—abstract and non-lin- 
guistic, presumably—whereof statements as wholes are names. A 
compound of the form ‘x = y’, then, is to be true when the state- 
ments joined by ‘=’ are names of the same proposition; and per- 
haps Whitehead’s vague criterion in terms of ‘‘motives to assent” 
is as reasonable a criterion as any for deciding whether two state- 
ments are names of the same proposition. It does seem more 
natural than the simpler alternative previously considered, 
whereby ‘x = y’ holds whenever both components are true or both 
false. 

By rephrasing the so-called propositional interpretation of the 
algebra, though, we can easily dispense with obscure entities and 
reinstate the common-sense view, according to which statements 
are not names at all. We may stipulate simply that the constant 
expressions substitutable for ‘x’, ‘y’, etc., are to be statements; we 
need not countenance any entities x, y, etc., whereof the substituted 
statements are names. Whereas ‘=’ in its ordinary sense of ‘is’ or 
‘is equivalent to’ is a verb, a connective of names, we must now 
reconstrue it in a fashion appropriate for connecting statements; 
and the obvious reading is ‘if and only if’. Just as ‘ry’ and ‘x + y’ 


10 Tbid., 108. 
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are read ‘x and y’ and ‘z or y’, so ‘x = y’ is read ‘zx if and only if y’; 
verbs appear only inside the component statements that are 
thought of as supplanting ‘z’ and ‘y’. And now it 7s quite natural to 
explain a compound of the form ‘rz = y’ as true whenever both its 
components are true or both false. 

Modalities, e.g., logical necessity, logical impossibility, logical 
equivalence, are most clearly construed as properties and relations 
of statements, and treated in a theory that discourses about state- 
ments. In such a theory, known nowadays as metalogic, variables 
‘zy’, ‘y’, etc., occur which refer to statements and thus admit not 
statements but names of statements as substituends. Thus the 
modalities are lifted out of logic proper, and reserved to a discipline 
that treats of the expressions used in that logic. Whitehead’s own 
subsequent work conformed to this trend; the modalities do not 
recur in his definitive formalization of logic." 

In Universal Algebra, however, Whitehead had a special use for 
his modal version of the algebra of logic; viz., in dealing with cate- 
gorical statements by MacColl’s method. The method consists in 
rendering ‘Every A is a B’, ‘No A is a B’, etc., in the respective 
forms ‘x7 = 0’, ‘ry = 0’, etc., where ‘x’ and ‘y’ stand for ‘It is 
an A’ and ‘It isa B’. The apparent advantage of this method is that 
it renders the propositional interpretation of the algebra adequate 
not only to the hypothetical logic but to the categorical logic as 
well; the shift to the classial interpretation for purposes of categori- 
cal logic becomes unnecessary. Thus it was that Whitehead con- 
ceived of the propositional interpretation as having “‘perhaps the 
best right to be called a system of Symbolic Logic.’’” 

But the method has shortcomings. First, it requires us to extend 
the notion of statement or proposition to include such expressions 
as ‘It is mortal’, where ‘it’ is thought of as having no grammatical 
antecedent, expressed or implied. This extension is drastic, for 
clearly such expressions admit neither of truth nor of falsehood. 
Second, the method is not very clearly suited to Whitehead’s ver- 
sions of ‘=0’ and ‘= 1’. There is an excess of rationalism in the view 
that ‘Every vertebrate has a heart’ expresses impossibility or “self- 
condemnedness”’ of ‘It is a vertebrate and it has no heart’. Third, 
the method cannot be extended to more complex cases. If, e.g., we 
construe: - 


11 For further discussion of these matters see §4 below; also §§4-5 of my 
Mathematical Logic. 
12 Universal Algebra, 111. 
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(3) Whatever interests every man interests Socrates 


as: 
(4) (it interests every man and it does not interest Socrates) = 0, 


and then we construe the fragment ‘it interests every man’ in (4) 
correspondingly as: 


(5) (it is a man and it does not interest it) = 0, 


we find ourselves involved in ambiguity; for the final result is in- 
distinguishable from the result obtained by starting not with (3) 
but with: 


(6) Whatever every man interests interests Socrates. 


Inadequacy to the analysis of complex cases like (3) is a limita- 
tion that is shared by the classial interpretation of the Boolean alge- 
bra, and by the traditional formal logic as well; and that it obtrudes 
itself particularly as a shortcoming of MacColl’s device is perhaps a 
virtue of the latter. Actually Frege had overcome this limitation 
during the very period of MacColl’s papers, and nineteen years 
before the appearance of Universal Algebra; for it was in 1879 that 
Frege created quantification theory and thereby founded modern 
mathematical logic.'® But Frege’s books remained long unnoticed. 
Only during the five years preceding Universal Algebra had Peano 
come to make extensive use of techniques similar to Frege’s; and 
not until two years after Universal Algebra did Whitehead come to 
know Peano’s work. Frege remained unknown to Whitehead for yet 
another two years. 


4 


Peano’s logic, together with certain emendations by Russell, 
came to Whitehead’s enthusiastic attention in 1900. “I believe,” 
Whitehead wrote, ‘that the invention of the Peano and Russell 
symbolism . . . forms an epoch in mathematical reasoning.’ In 
particular, he deemed it virtually indispensable as a tool for explor- 
ing Cantor’s new theory of infinite cardinal numbers. 


The abstract nature of the [latter] subject makes ordinary language 
totally ineffective, only gaining precision by verbosity, and imagination 


13 Quantification theory will be taken up in §5. 
141902, 367. 
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is very misleading, since it presents to us special aggregates which are 
denumerable or of the power of the continuum. Thus we are thrown back 
onto a strict logical deduction by the symbolic method." 


Russell, in 1900, was working on the first volume of his Principles 
of Mathematics. Here he undertook to analyze the logical founda- 
tions of arithmetic and other parts of mathematics within the 
medium of ordinary language, without explicit recourse to Peano’s 
symbolism. This volume, ‘‘which may be regarded either as a com- 
mentary upon, or as an introduction to, the second volume, is 
addressed in equal measure to the philosopher and to the mathe- 
matician.’’® The projected second volume, on the other hand, was 
to spare no rigor and no detail. Here the reader was to be exposed 
to the full blast of Peano’s symbols. To quote Russell: 


The second volume, in which I have had the great good fortune to se- 
cure the collaboration of Mr. A. N. Whitehead, will be addressed to 
mathematicians; it will contain chains of deductions, from the premises 
of symbolic logic through Arithmetic, finite and infinite, to Geometry, 
in an order similar to that adopted in the present volume; it will also 
contain various original developments, in which the method of Professor 
Peano, as supplemented by the Logic of Relations, has shown itself a 
powerful instrument of mathematical investigation. 


Such was the collaboration on which Whitehead and Russell em- 
barked in 1900, with little thought that they were to be engaged in 
it for eleven years. But, they recounted afterward, 


as we advanced it became increasingly evident that the subject is a very 
much larger one than we had supposed; moreover, on many fundamental 
questions which had been left obscure and doubtful in the former work, 
we have now arrived at what we believe to be satisfactory solutions. It 
therefore became necessary to make our book independent of The Prin- 
ciples of Mathematics.'® 


So Volume I of the Principles remained, like that of Universal 
Algebra, without a mate. Instead of Volume II there appeared, in 
1910-1913, one of the great intellectual monuments of all time: the 
three volumes of Principia Mathematica. Even this vast work stops 
short of the geometry that had been slated as part of Volume II of 
Principles; this material was deferred still to a projected fourth 
volume of Principia.” 


18 Principles of Mathematics, Preface. 
16 Principia Mathematica, Vol. 1, v. 
17 Tbid., Vol. 3, v—vi. 
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Part I of Principia, occupying the first half of the first volume, 
is entitled ‘““Mathematical Logic.” It begins with what is sub- 
stantially Boolean algebra in its simplest propositional interpreta- 
tion; but without ‘=’, ‘0’, and ‘1’, for we have seen ($3) that these 
are superfluous under this interpretation. The letters ‘p’, ‘g’, and 
others are used as propositional variables; and the Boolean opera- 
tions of negation, addition, and multiplication—i.e., in the present 
interpretation, denial, alternation, and conjunction—are expressed 
in the fashion ‘~p’, ‘p V q’, ‘p.q’. Only the first two of these are 
taken as primitive, or undefined; the third is defined in terms of the 
other two as ‘~(~p V ~gq)’. Further defined compounds are 
‘pn > ¢@ and ‘p = q’; these are defined respectively as‘~p V q’ and 
‘(p D g)- (q> p)’, and are read ‘p implies q’ and ‘p is equivalent 
to q’. 

The laws: 


eM, Gov®g, @PvaodGy op), 
(CT ie Vag) Se vr) 


are adopted as formal axioms, together with a fifth that has since 
been proved redundant;!® and a couple of hundred further laws, 
hitherto recorded for the most part by Boole’s early followers C. S. 
Peirce and Ernst Schréder, are deduced from these axioms as 
theorems. The deductive procedures used are two: substitution of 
appropriate expressions for variables, and modus ponens (i.e., can- 
cellation of the first part of a theorem or axiom of the form ‘p +> q’ 
when that first part is itself a theorem or axiom). Such rules of 
deductive procedure call for metalogical formulation (cf. §3), as 
rules about expressions of the sort that occur as axioms and theo- 
rems; however, the rule of modus ponens shows its metalogical 
status less clearly in Principia than it had in Frege’s work, and the 
rule of substitution is left tacit altogether.” 

Whitehead and Russell view definitions as conventions of ab- 
breviation; thus ‘p . q’, “p D q’, and “p = q’ are mere shorthand for 
the more official renderings ‘~(~p V ~q)’, ‘~p V q’, and 
‘(“~p V q).(~¢@ V p)’. Now over the legitimacy of such conven- 
tions there can be no dispute; but dispute may still arise over the 


18 My discussion of Principia will relate solely to the first edition; for White- 
head had no hand in the Introduction and Appendices that were added to Vol. 
1 in the second edition. Cf. Mind 35 (1926), 130. 

19 Cf. Bernays, 1926. 

20 See Frege, 1893, 25; Russell, 1919, 151. 
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attempt to paraphrase such a defined sign in words—e.g., ‘.’ as 
‘and’, ‘’ as ‘implies’, ‘=’ as ‘is equivalent to’. Against reading 
‘>’ as ‘implies’, objections have indeed been raised.” It is objected 
that every false proposition then comes to imply every proposition, 
and that every true proposition comes to be implied by every 
proposition, in glaring violation of any ordinary idea of implication. 
Exception has been taken similarly to the proposed reading of ‘=’. 

This controversy would not have arisen if the notion of state- 
ments as naming had been carefully avoided, and the variables ‘p’, 
‘q’, etc., had been treated explicitly as standing in positions appro- 
priate to statements rather than names (cf. §3). The contested read- 
ings of ‘=’ and ‘)’ would then never have suggested themselves. 
The natural readings are ‘if and only if’ and ‘only if’, which, like 
‘and’ and ‘or’, are the sort of expressions that appropriately join 
statements to form statements. ‘=’ is the biconditional sign, and 
‘>’ the conditional sign. Implication and equivalence, on the other 
hand, are relations between statements, and hence are properly 
attributed by putting a verb ‘implies’ or ‘is equivalent to’ between 
names of the statements related. Such discourse belongs to meta- 
logic. 

In Principia, as in Frege’s logic, one statement is capable of con- 
taining other statements truth-functionally only; i.e., in such a way 
that the truth value (truth or falsehood) of the whole remains un- 
changed when a true part is replaced by any other truth, or a false 
part by any other falsehood. Preservation of this principle of truth- 
functionality is essential to the simplicity and convenience of logical 
theory. In all departures from this norm that have to my knowledge 
ever been propounded, moreover, a sacrifice is made not only with 
regard to simplicity and convenience, but with regard even to the 
admissibility of a certain common-sense mode of inference: infer- 
ence by interchanging terms that designate the same object.” On 
grounds of technical expediency and on common-sense grounds as 
well, thus, there is a strong case for the principle of truth-func- 


21 Notably by Lewis, Chaps. 4-5. 

22 Lewis and Langford, e.g., use a non-truth-functional operator ‘ ¢’ to ex- 
press logical possibility. Thus the statements: 

¢ (number of planets in solar system < 7), 
0 (9 <7) 

would be judged as true and false respectively, despite the fact that they are 
interconvertible by interchanging the terms ‘9’ and ‘number of planets in 
solar system’, both of which designate the same object. Similar examples are 
readily devised for the early Whitehead system discussed in §3. 
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tionality. But it is a mistake to suppose that in order to preserve 
this principle we must construe the relations of implication and 
equivalence truth-functionally. It would be as reasonable to sup- 
pose that we must so construe the relation of rhyming as to make it 
depend on the truth values of the rhyming statements. The prin- 
ciple of truth-functionality concerns only the constructing of state- 
ments from statements; whereas relationships such as implication 
or equivalence or rhyming are properly ascribed rather by attaching 
verbs to names of statements. It is regrettable that Frege’s own 
scrupulous observance of this distinction between an expression and 
its name, between use and mention, was so little heeded by White- 
head, Russell, and their critics. 


4) 


The theory of quantification occupies the next portion of Part I of 
Principia. Universal quantification consists in attaching the so- 
called universal quantifier ‘(a)’ [or ‘(y)’, etc.], which may be read 
‘whatever object x [or y, etc.] may be’, to an expression that has the 
form of a statement but exhibits recurrences of the variable ‘z’ 
[or ‘y’, etc.]. Thus (8), above, can be rendered as: 


i) (x)(x interests every man _+ z interests Socrates). 
The part ‘x interests every man’ here can be rendered in turn as: 


(y)(y is a man _- z= interests y), 
so the whole becomes: 


(8) (z)((y)(y is a man > =z interests y) D x interests Socrates), 


whereas (6) becomes rather: 
(9) (x)((y)(y is a man > y interests x) D x interests Socrates). 


Thus it is that quantification theory solves the problem of distin- 
guishing between (3) and (6). * 

Existential quantification consists in applying rather the prefix 
‘(ax)’ [or ‘(ay)’, etc.], which may be read ‘there is an object x [or y, 
etc.] such that’. Thus ‘Some primes are even’ would be rendered: 


(10) (Jx)(x is a prime . z is even). 
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Whitehead and Russell define this sort of quantification in terms of 
the other, by explaining ‘(37)’ as short for ‘~(z2)~’. 

Just as in the simpler part of logic Whitehead and Russell used 
‘np’, ‘q’, ete., to stand in place of statements, so in quantification 
theory they use ‘dz’, ‘vy’, ‘x(x, y)’, etc., to stand in place of expres- 
sions of the kind that I call matrices?*—such expressions as ‘x 
interests Socrates’, ‘y is a man’, ‘x interests y’. To the axioms 


hitherto noted, two more are now added: 


(x)ox D oy, (x)(p V ox) D (p V (aba), 


A hundred theorems are derived, by use of the two deductive pro- 
cedures hitherto noted and three more: substitution for complexes 
such as ‘oz’, relettering of variables used in connection with quan- 
tifiers, and introduction of new initial quantifiers. The new rule of 
substitution, like the old one, is left tacit; and so is the rule of re- 
lettering. Correct formulation of these rules, particularly that of 
substitution, is not easy.”° 

In its main lines this theory of quantification follows Frege’s. 
Peano’s methods, though adequate to the same purposes, differed 
in essential respects and were not as elegant. Whitehead and Rus- 
sell’s notation, set forth above, is more expedient typographically 
than Frege’s, but Frege’s exposition was more precise. 

Embedded within the midst of quantification theory, as rendered 
by Whitehead and Russell, there is an inconspicuous detail that em- 
bodies the germ of a Platonic ontology of universals. This detail is 
properly inessential to quantification theory itself, but in some 
form it is essential to any foundation of classical mathematics. It is 
just this: ‘¢’, ‘y’, etc., are allowed to occur in quantifiers. The sig- 
nificance of this detail will be clarified. 

Let us consider again the letters ‘p’, ‘q’, etc. These belong to 
a schematism for diagrammatically depicting the patterns of 
compound statements. The letters enter the diagrams ‘p) q’, 
‘~(p.~p)’, etc., as dummies for statements; they occupy posi- 
tions that would be occupied by actual statements within any actual 
compound statement of the depicted pattern. Analogously, the 


23 T am passing over the alternative development proposed in *9 of Principia. 
The main point of this alternative is in relation to the ramified theory of types, 
discussed in §7 below. 

24 Mathematical Logic, 71ff. 

25 For a formulation of the rule of substitution see Hilbert and Ackermann, 
2d ed., 56f. The attempted formulation in the 1st ed., 53, was wrong. For the 
rule of relettering, see my Mathematical Logic, §21. 
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complex variables ‘da’, ‘dy’, ‘yx’, ‘x(x, y)’, etc., enter the diagrams 
as dummies for matrices; they are written in complex form only to 
remind us, e.g., that ‘dx’ and ‘dy’ supplant matrices that are alike 
except for their component variables ‘x’ and ‘y’. Expressions in- 
volving dummies ‘p’, ‘gq’, ‘dz’, etc., are from this point of view not 
statements, nor matrices, but diagrams or schemata; such in par- 
ticular are the six axioms noted above. Moreover the variables ‘p’, 
‘q’, etc., used thus in schemata, need not refer to objects of any sort 
as their values; nor do ‘dz’, ‘vy’, etc. The statements and matrices 
that these variables supplant need not be viewed as names of any- 
thing at all (cf. §3). 

The letters ‘x’, ‘y,’ etc., themselves, fundamentally different in 
status from ‘p’, ‘q’, ‘dx’, etc., turn up not merely in schemata but 
also in the matrices themselves and even in actual statements—e.g. 
(7)—(10). These letters, adjuncts to the quantification notation, 
must be recognized as ordinary variables referring to objects. The 
context (10), e.g., is read ‘There is something x such that 2¢ is prime 
and even’; and (7) is read ‘Whatever entity x may be, if z¢ interests 
every man then zt interests Socrates.’ The letter ‘x’ does refer, in its 
pronominal fashion, to each of the entities of our universe—what- 
ever those may be.”é 

The effect of letting ‘¢’, “y’, etc., occur in quantifiers, now, is that 
these letters cease to be fragments merely of dummy matrices” 
‘px’, ‘py’, etc., and come to share the genuinely referential power of 
‘x’, ‘y’, etc. They must now be regarded as variables in their own 
right, referring to some sort of abstract entities, perhaps attributes, 
as their values; and they are eligible now to occur in genuine matri- 
ces and statements.”® One such statement is ‘(3¢)(x)¢x’, which may 
be read ‘There is an attribute ¢ such that, no matter what x may be, 
x has the attribute ¢.’ The juxtapositive notation ‘¢z’, no longer an 
indissoluble dummy matrix, comes to express the attribution of an 
attribute ¢ on the one hand to an object x on the other. ‘d(z, y)’, 
similarly, comes to express attribution of a dyadic or relative attri- 
bute ¢ to the respective objects xz and y. 


26 Cf. my “‘Designation and existence.”’ 

27 My technical term for these is ‘atomic matrix frames’. In general, dia- 
grams or schemata for statements and matrices are in my terminology frames. 
The ‘¢’ occurring as a fragment of ‘dz’, ‘dy’, etc., I call a predicate variable; 
and ‘p’, ‘g’, etc., I call statement variables. Variables such as ‘z’, ‘y’, etc., that 
can occur in quantifiers and hence in actual matrices and statements, I call 
pronominal variables. See my Elementary Logic (1941), §§14, 33, 42. 

8 T.e., the letters ‘¢’, ‘y’, etc., cease to be predicate variables and become 
pronominal variables whose values are attributes; ef. preceding footnote. 
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In putting ‘¢’, ‘y’, etc., in quantifiers Whitehead and Russell leap 
from quantification theory to a theory of attributes—a theory in- 
volving over and above quantification the notion of attribution. 
But this new notion is never explicitly recognized. The relevant 
distinctions are blurred by use of the phrase ‘propositional function’ 
to refer indiscriminately both to expressions of the kind that I have 
called matrices and to objects of the kind that I have called attri- 
butes. 

It is properly on the theory of attributes, rather than that of 
quantification, that Whitehead and Russell’s definition of identity 
is rested; for ‘x = y’ is defined with help of quantified ‘¢’ as 
‘(6)(¢z D oy)’, i.e., ‘Every attribute of x is an attribute of y’. The 
theory of identity is one of the many branches of logic and mathe- 
matics that are successively developed from the foregoing logical 
basis, in the course of Principia, by adoption of appropriate defini- 
tions. The logical basis itself remains as above; further primitive 
equipment is never added. 

The next branch of logical theory developed in Principia is that 
of descriptions. The notation of description consists in applying the 
prefix ‘(.2)’ [or ‘(.y)’, etc.] to a matrix in order to designate the one 
and only object z [or y, etc.] that satisfies the matrix; e.g., (2) (a? = 
8) is the one and only number whose cube is 8, viz., 2. Notations to 
this effect were used by both Frege and Peano, but Russell was the 
first (1905, 1908) to define the device in terms of more basic no- 
tions. His definition, used in Principia, is contextual: a description 
is explained not in isolation but as part of a broader context, which 
is defined as a whole. A whole context ‘Y(:.2)¢x’—to use the sche- 
matic notation—is explained as an abbreviation for: 


(Jy) (x) ((2@ = y = or) . vy). 


If there is one and only one object x such that ¢x—symbolically, 
if (Ay)(z)(a = y = ¢2), or briefly E!(.7)¢z—then the description 
‘(1z) 62’ as contextually defined comes to behave as if it designated 
that one object; and in other cases the context of the description 
simply becomes false. To speak of “the context’ here is really 
ambiguous, since a single occurrence of a description commonly 
has many contexts of varying lengths; conventions are added, how- 
ever, to resolve the ambiguity. 


297 omit a complication involving the notion, discussed in §7 below, of 
“predicative function.”’ 
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In Principia the prefix ‘é’ (or ‘g’, etc.) is applied to a matrix to 
designate the class of all objects x (or y, etc.) satisfying that matrix. 
The class of all sons of lawyers, e.g., is 


£(3y)(y is a lawyer .z is son of y). 


This device of abstraction of classes had been used, under different 
notations, by Frege and Peano. A companion notation, due to 
Peano, is that of membership: ‘x € a’, meaning that x is a member of 
the class a. Thus the combination: 


ze &(dy)(y is a lawyer . x is son of y) 
amounts merely to: 
(3y)(y is a lawyer .z is son of y). 


Any instance of the schema: 


(11) (z2e£6x) = $2 


evidently holds in similar fashion. 

But what sort of things are these classes? Not mere collections, 
or aggregates, in any concrete sense. A class a of concrete things is 
by no means identifiable with the concrete total object ¢ made up 
of those things. On the contrary, a is not even determined by ¢; for 
a might have as members either many small parts of ¢ or a few 
large ones. A class is an abstract entity, a universal, even if it 
happens to be a class of concrete things. 

To the nominalist temper, accordingly, the elimination of classes 
in favor of expressions is a congenial objective; and this is what 
Russell is sometimes believed® to have done in 1908 when he 
showed how contexts ostensibly treating of classes could be con- 
strued as abbreviations of other expressions wherein reference is 
made only to ‘‘propositional functions.’’ These constructions re- 
appear in Principia; membership, abstraction, even quantifica- 
tion with respect to class variables ‘a’, ‘8’, etc., are introduced by 
contextual definition as notational abbreviations. The principle 
(11) is deduced, in view of these definitions, from the antecedent 
logical basis. 

3 See, e.g., Hahn, 22, 
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So long as ‘propositional function’ is thought of in the sense of 
‘matrix’, such a construction would seem to serve its nominal- 
istic objective; but actually Russell’s construction involves use of 
‘p’, ‘Wy’, etc., in quantifiers, and hence calls for propositional func- 
tions in the sense rather of attributes. To have reduced classes to 
attributes is of little philosophical consequence, for attributes are 
no less universal, abstract, intangible, than classes themselves. 

Indeed, there is no call even to distinguish attributes from 
classes, unless on this one technical score: classes are identified 
when they coincide in point of members, whereas it may be held 
that attributes sometimes differ though they are attributes of just 
the same things. It is precisely this difference, in fact, and nothing 
more, that Russell’s contextual definition of classes accommodates; 
his is a technical construction enabling us to speak ostensibly of 
identical classes by way of shorthand for discourse about coincident 
but perhaps non-identical attributes. 

Such definition rests the clearer on the obscurer, and the more 
economical on the less. Classes are more economical than attributes 
because they are scarcer: they coalesce when their members are the 
same. Classes are clearer than attributes because they have a rela- 
tively definite principle of individuation: they differ from one 
another just in case their members differ, whereas attributes (if 
they diverge from classes at all) differ from one another also under 
additional circumstances whose nature is left, in Principia, quite 
unspecified. Furthermore any attempt to specify those additional 
circumstances may be expected eventually to run afoul of the rule 
of ‘putting equals for equals,”’ analogously to what was observed 
earlier (§4) in the case of non-truth-functional statement composi- 
tion. 

In any case there are no specific attributes that can be proved 
in Principia to be true of just the same things and yet to differ 
from one another. The theory of attributes receives no applica- 
tion, therefore, for which the theory of classes would not have 
served. Once classes have been introduced, attributes are scarcely 
mentioned again in the course of the three volumes. 

The clear course would have been to introduce the membership 
notation ‘za’ at an earlier point, as a primitive notation taking 
the place of the juxtapositive notation ‘¢z’ insofar as the latter is 
used in its ill-recognized attributional sense. Thus, e.g., ‘(Ja)(x) 
(x € a)’ would supplant ‘(3¢)(x)¢z’. The notation of class abstrac- 
tion would still admit of definition, viz., by explaining ‘¢¢z’ as 
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short for ‘(1a)(r)(x7 ea = o2)’.*! Just one axiom schema would 
have to be added, viz., ‘E!é¢z’;32 whereupon (11) and all other 
principles of the Principia theory of classes would be forthcoming. 
The notation ‘¢z’ might survive in the status of dummy matrix for 
expository convenience (as here), but ‘¢’ would never be isolated 
in quantifiers nor used in actual statements and matrices. The 
notion of attribute, insofar as it diverges from that of class, would 
not occur. 

The course proposed would have been a little closer to Peano 
and Frege, and its adoption would have obviated some of the 
vaguest stretches of Principia. Perhaps it would have been adopted 
if the use of the phrase ‘propositional function’ indiscriminately 
for matrices and for attributes had not obscured essential cleavages. 

The contextual definition of classes is repeated in Principia for 
the dyadic case. Here, instead of classes properly so called, we have 
“relations in extension”’—classes, substantially, of ordered pairs. 
Parallel to the notation ‘x € a’ of class membership, there is in the 
case of relations the notation ‘rRy’ to the effect that x bears the 
relation FR to y. Abstraction, in the case of relations, is expressed by 
double prefixes such as ‘#7’. 

The reform proposed above for the theory of classes carries over 
to the theory of relations in strict analogy. ‘éy¢d(a, y)’ comes to be 
explained as short for: 


OR) (x) (y)leRy = o(2, y)I, 


while ‘cRy’ comes to figure as a primitive notation supplanting 
the attributional version of ‘¢(z, y)’. An axiom schema analogous 
to ‘Elé¢x2’ has to be added, viz., ‘El¢9¢(z, y)’. 

This reform of the foundations does not affect the ensuing con- 
structions. In the course of these constructions Boolean algebra is 
reconstructed in its classial interpretation, by defining the comple- 
ment —a of aclassaas¢~ (rea), the product of classes a and B 
as £(vea.xef), and the sum as £(rea V xe 8). The universal 
class, designated by ‘V’, is defined as #(z = x); and the null class A 
is defined in turn as —V. An exactly parallel algebra of relations is 
also developed, wherein the complement —R of a relation FR is 
defined as #9 ~ (zRy), the product of relations Q and R as 
£9(cQy .xRy), and so on. Theorems are deduced at considerable 


| 31 Substantially this definition was used by Whitehead on a later occasion 
1934). 
82 Cf. §5, above. This schema combines P1 and R3 of my ‘‘New foundations.”’ 
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length in the class algebra, and then parallels are listed for relations. 

This wasteful duality would have been avoided if, like Peano, 
the authors had treated relations quite literally as classes of ordered 
pairs. The notion of an ordered pair x;y could be adopted as primi- 
tive, as in Peano, whereupon ‘xRy’ could be explained as short for 
‘z;y € R’. Relational abstraction, correspondingly, could be reduced 
to class abstraction by explaining £7¢(z, y) as 


2 (ax) (Ay) (2 = xy . o(@, y)). 
Such of relation theory as parallels general class theory then comes, 
in large part, simply to be absorbed into the latter. Incidentally 
the axiom schema ‘E!£/¢(z, y)’, noted above, becomes dispensable 
in favor of: 


(x;y = 2;w) D> (w@=2z.y = w). 
And one of the advances in logic since Principia is the discovery 
by Wiener that this latter axiom and the primitive notion of 


ordered pair that it governs can be eliminated in turn. The ordered 
pair is satisfactorily definable on the basis of class theory alone. 


ih 


Russell was the first to discover that the principle (11), if ad- 
mitted without special restrictions of one sort or another, leads to 
contradiction. If in particular we put ‘a’ for ‘x’, ‘~(aeqa)’ for 
‘px’, and ‘8 ~ (ae a)’ for ‘z’, we get as an instance of (11) the self- 
contradictory statement: 


(12) [@~ (aea) ea~ (Rea)] = ~[8 ~ (aea) e€a~ (aea)]. 


Both Frege’s logic and Peano’s turn out, in this way, to be involved 
in contradiction. 

Russell’s remedy is his theory of types. Every class is conceived as 
belonging to one and only one of a hierarchy of so-called types; and 
any formula that represents membership as holding otherwise than 
between members of consecutive ascending types is rejected as 
meaningless, along with all its contexts. In particular, thus, ‘a € a’ 
and all its contexts are meaningless. Such is the status of (12), 
which thus ceases to count as an instance of (11). (12) is only one 
of infinitely many contradictions derivable from (11); but all these, 
so far as is known, are banished like (12) by the theory of types. 
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Relation theory, though, involves analogous contradictions, so 
Russell has to extend the theory of types to cover relations as well 
as classes. The theory here is more complicated; the two degrees of 
freedom implicit in the ordered pair call for more than a simple 
hierarchy of types. The theory of “‘propositional functions,” insofar 
as it makes use of ‘¢’, ‘Y’, etc., in quantifiers, is of course no less 
liable to the contradictions than are class theory and relation 
theory; and from the point of view of Principia it is actually not to 
classes and relations but to “propositional functions” that the basic 
version of type theory applies. From this equivocal realm the 
theory is transmitted to classes and relations through the con- 
textual definitions of these latter. 

As applied to propositional functions the theory undergoes a 
special complication: a hierarchy of orders is superimposed, such 
that propositional functions may differ as to order even though they 
be of the same type. This ramification of type theory proves to 
subject logic and mathematics to certain intolerable restrictions, 
for the removal of which an axiom of reducibility is adopted. Accord- 
ing to this axiom, every propositional function is satisfied by 
exactly the same arguments as some propositional function which is 
predicative, 1.e., which has the lowest order compatible with its 
type. 

This ramification of type theory is designed for the avoidance 
of certain contradictions of a quite different sort from (12). But 
the treatment is vague, on account of failure to distinguish be- 
tween expressions and their names. On restoring this distinction 
one finds that the contradictions against which this part of type 
theory was directed are no business of logic anyway; they can 
arise only in discourse that goes beyond pure logic and imports 
semantic terms such as ‘true’ or ‘designates’. The whole ramifica- 
tion, with the axiom of reducibility, calls simply for amputation.® 

It is readily seen also on other grounds that this part of type 
theory was bound to be wholly idle. The axiom of reducibility 
assures us that from the beginning we could have construed the 
notations of Principia as referring exclusively to so-called pred- 
icative ‘‘propositional functions” (predicative attributes); but 
when this is done, the resulting logic is the same as if neither 
“orders” nor ‘“‘predicativity” nor ‘“‘reducibility” had been thought 
of in the first place.** That this simple situation escaped the atten- 


33 Cf. Ramsey, 20-29. 
% Cf. my paper ‘‘On the axiom of reducibility.’’ 
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tion of the authors is attributable, again, to the ambiguity of 
‘propositional function’ and the underlying difficulty over use 
and mention. 

The residual unramified type theory, as applied to classes and 

relations, is actually the only form of type theory that figures even 
in Principia after the first couple of hundred pages; for, as re- 
marked, ‘‘propositional functions’ are soon submerged. But even 
this simpler version of type theory is a source of much complexity 
of technical detail. 
Because the theory allows a class to have members only of uniform type, 
the universal class V gives way to an infinite series of quasi-universal 
classes, one for each type. The negation —a ceases to comprise all non- 
members of a, and comes to comprise only those non-members of a which 
are next lower in type than a. Even the null class A gives way to an infinite 
series of null classes. The Boolean class algebra no longer applies to classes 
in general, but is reproduced rather within each type. The same is true 
of the calculus of relations. Even arithmetic, when introduced by defini- 
tions on the basis of logic, proves to be subject to the same reduplications. 
Thus the numbers cease to be unique; a new O appears for each type, 
likewise a new 1, and so on, just as in the case of V and A. Not only are all 
these cleavages and reduplications intuitively repugnant, but they call 
continually for more or less elaborate technical manoeuvres by way of 
restoring severed connections.* 

In particular, an awkward effect is obstruction of the proof 
that n ¥ n + 1 for all finite n; this principle comes rather to be 
cited as hypothesis where needed (ef. §9), and Whitehead and 
Russell avoid it whenever they can at the cost of more circuitous 
proofs. In one way and another the theory of types accounts for 
perhaps a fifth of the page-count of the three volumes. 

In 19384 Whitehead presented an alternative foundation for 
arithmetic, designed both to obviate the reduplication of numbers 
and to render ‘n ~ n + 1’ demonstrable in straightforward logical 
fashion. His method, though, is costly; for it turns essentially on 
the use of statements in non-truth-functional contexts (cf. §4). 
For all its cost, moreover, the method continues to presuppose the 
theory of attributes (using ‘¢’, ‘y’, ete., in quantifiers); and in 
that domain the theory of types remains intact. 

Whatever the inconveniences of type theory, contradictions 
such as (12) show clearly enough that the previous naive logic 
needed reforming. The theory of types (as applied to classes 


% From my ‘‘New foundations.” 
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and relations, hence without the ramus amputandus) remains 
one of the important proposals for a reformed logic. There have 
been other proposals to the same end—one of them even coeval 
with the theory of types.** None of these other proposals in- 
volves that reduplicative situation whose irksomeness, for White- 
head among others, has been noted; and not all of these proposals 
obstruct the proof that n ~ n + 1.# 

But a striking circumstance is that none of these proposals, 
type theory included, has an intuitive foundation. None has 
the backing of common sense. Common sense is bankrupt, for 
it wound up in contradiction. Deprived of his tradition, the 
logician has had to resort to mythmaking. That myth will be 
best that engenders a form of logic most convenient for mathe- 
matics and the sciences; and perhaps it will become the common 
sense of another generation. 


In the remainder of Part I numerous special notions having 
to do with relations are defined and hundreds of theorems are 
derived. One important notion is R“a, the class of all objects 
bearing R to any members of a. The definition is obvious: 
#@y)(y «a. xRy). Another is the relative product R | S, defined as 
£2(dy)(aRy .ySz). Another is R, the converse of R, defined as 
gé(aRy). These notions had figured prominently in the early work 
on relation theory by Augustus De Morgan and C. 8. Peirce. 
But these pioneers did not distinguish explicitly between the 
first two of the notions; Frege was the first to do so. 

Another important notion is R‘x, defined as (1y)(yRz). So 
long as one and only one thing bears FR to x, that thing is R‘z. 
If in keeping with modern trends we explain a function in gen- 
eral as a one-many relation, i.e., a relation that no two things 
bear to the same thing,® then the notation ‘R‘z’ may be spoken 
of as that of functional application; where R is a function, R‘r 
is the value of R for the argument z. 


% Zermelo, 1908. 

77 In my Mathematical Logic, e.g., this is readily deducible from {677 (p. 252) 
with help of mathematical induction. 

% Cf. Peano, 1911. 
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. e = <= e 
Special functions D, C, R, and R are then defined, in such a 


way that D‘R, C‘R, Rc, and Rix turn out to be respectively 
£(ay)(aRy), g(ax)(«Ry), G(yRz), and ¢(zRy). But this and other 
portions of Princtpia would have been shorter and better if every 
new notation had been required to pay its way or suffer deletion. 
D‘R and C‘R, called the domain and converse domain of R, could 
have been expressed quite as briefly in terms of previous notions, 
without further definition; viz., as R“V and R“V. Also, given 
Peano’s notation ‘tz’ for the class ¥(y = x) whose sole member is 


x, the notations Rx! and Ree’ could have been dropped in favor 
of Raz and Rix. Use of ‘RV’, ‘RV’, ‘Rix’, and ‘R“uw aweulal 
have called for very few special theorems, over and above the 
general ones governing ‘R“‘a’. On the other hand the adoption of 
‘D‘R’ and the rest calls for chapters of additional theorems by 
way of relating these new notations to old ones and to one another; 
and thereupon we find ourselves manipulating ‘D‘R’, ‘d‘R’, etc., 
according to the new special laws, neglecting general laws about 
‘Ra’ that would have served as well. It is important to remember 
that algorithmic power turns not on assorted occurrences of many 
signs, but on repeated occurrences of a few. 

Functional application, under a different notation, played a 
prominent role in Frege’s logic; and so did a companion notion 
of functional abstraction. This latter consists in applying the 
prefix ‘A,’ (or ‘A,’, etc.)*® to a term containing ‘x’ (or ‘y’, etc.), 
in order to designate the function whose value, for any argument 
x, is the object referred to by the original term; thus A,(z*) is 
the function “cube of,” i.e., the function whose value for any 
argument x is x*. Functional abstraction is related to functional 
application precisely as class abstraction is related to member- 
ship. Just as membership cancels class abstraction (cf. (11) ), 
so application cancels functional abstraction; .(z*) ‘y, eg., is 
y®. Functional abstraction is readily introduced on the basis of 
relational abstraction; \.(x°), e.g., is simply yf(y = 2°). 

The fact that functional abstraction was not carried over 
into Principia accounts for a good deal of inelegance. A special 
definition is adopted, e.g., explaining ‘R,.’ as short for ‘Ba(8 = 
Ra)’; whereas ‘\.(R“‘a)’, if available, would have served instead 
of that special sign. The devious course of defining ‘D’ in isolation 


* I depart from Frege’s notation in favor of Church’s, which is more familiar 
nowadays. 
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as ‘Okla = £(ay)(xRy))’, instead of defining ‘DR’ outright as 
‘¢(dy)(xRy)’ (or using ‘R“V’ instead), is another consequence 
of the lack of functional abstraction. The course was prompted 
by the need of D, on rare occasions, as a function in its own night; 
but if functional abstraction had been at hand, the infrequent 
need of ‘D’ in isolation would have been served by ‘ArDR’ (or 
‘\r(R“V)’). The situation is precisely similar with ‘A’, ‘RP, and 
'R’ and the situation is similar also with ‘’, for Whitehead and 
Russell introduce Peano’s ‘x’ as ‘*x’ in parallel fashion to ‘D‘?’ 
and the rest. Profitable use might have been made of functional 
abstraction at many other places in Principia as well.” 


9 


Classes are said to have the same cardinal number just in case 
they are respectively the domain and the converse domain of a 
one-one relation; i.e., of a function (in the sense lately explained) 
whose converse is a function. Where e@ and 8 are any mutually 
exclusive classes whose respective cardinal numbers are p» and », the 
arithmetical swm of uw and »v is defined as the cardinal number 
of the logical sum of a and £; the arithmetical product of u and 
y is the cardinal number of the class of all pairs 7;7 such that 7 € @ 
and j « 8; and yu to the power v is the cardinal number of the class 
of all functions having 6 as converse domain and part or all of @ 
as domain. A number is infinite if it is at once the cardinal number 
of distinct classes a and 6 such that aC 8, i.e., such that (z) 
(cea > xe 8); otherwise finite. 0 is defined as the cardinal number 
of the null class; 1 is the cardinal number of classes of the form cz; 
and each further finite cardinal is expressible in terms of 1 and 
arithmetical addition. The first infinite cardinal, No, is defined as 
the cardinal number of the class of all finite cardinals. The ordering 
of cardinals is fixed by the stipulation that u S v wherever pu and v 
are the cardinal numbers respectively of classes a and 8 such that 
eC £B. 

Such are the elementary constructions of Cantor’s theory of 
cardinal numbers. This much was set forth explicitly by Peano 
in his logical notation.*+ But note that the cardinal number of 


0 Notably Vol. 1, *88 passim; Vol. 2, *150-01-02. 
411901, 70-72. 
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a, symbolically Num a, has been defined only in the context 
‘Num a = Num @’; not in isolation. Actually Num a could very 
easily be defined in isolation, viz., as the class of all domains of 
one-one relations having a as converse domain; and such, years 
earlier, was Frege’s course.*? But Peano rejected this course, 
asserting groundlessly that Num a and the class in question 
‘font des propriétés différentes.’ 

In early sections of Whitehead’s 1902 paper ‘On cardinal 
numbers,”’ Peano’s formulation of Cantor’s constructions is re- 
produced with certain emendations attributed to Russell. Here 
Frege’s definition of cardinal number that Peano had rejected 
is adopted. Here also we find a definition of the class of finite 
cardinals that goes back to Frege; 1t turns not on the criterion 
indicated above, but rather on the idea of accessibility from 0 
by successive additions of 1. It goes into the Principia notation 
thus: 


R(x) [v~)vex Dv +lex) D OexD vex). 


Russell had learned it through Dedekind, being ignorant still of 
Frege when this paper went to press. 

In this paper Whitehead redefines the power yp” as the number 
of ways of picking exactly one member from each of v mutually 
exclusive classes having » members apiece; and he derives the 
old definition as a theorem. Further, he defines the sum and the 
product of any class of cardinals—covering the case where the 
cardinals belonging to the class are not merely infinite but infinitely 
numerous. He proves various familiar arithmetical theorems, in 
extension to infinite cardinals; the most striking case, perhaps, is 
the binomial theorem, whose explicit formulation alone is no small 
undertaking. 

The proofs of various of the theorems in this paper and its 1904 
sequel depend on two hypotheses: (1) every infinite class is ex- 
haustively resoluble into parts having No members apiece, and (2) 
of any two cardinal numbers one exceeds the other. Both are reduc- 
ible to a far more basic hypothesis, Zermelo’s famous axiom of 
choice; but this was for the future, for the axiom itself dates only 
from 1904. 

The definitions in the two Whitehead papers tend to be less 


421884 79-85; 1893, 56f. 
431901, 70. 
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rigorous than in Peano. Many of them omit essential quantifiers.“ 
In the definition of the number 1 there is also a further defect, 
whereby the null class comes inadvertently to be admitted as a 
member of 1.“ There is a notation of multiplication that receives no 
definition, though it appears in two theorems.* In general the 
proofs of theorems are sketchy; and in particular I have been un- 
able to decipher the purported proof that n # n + 1 for finite n.” 

Still the two papers were highly significant, affording as they did 
the first considerable development of infinite arithmetic within 
mathematical logic. Cantor and others had used no logical formal- 
ism; Peano had deduced none but a few trivial theorems in infinite 
arithmetic; and Frege, though deducing the laws of finite arithmetic 
in rigorous detail, did little with infinite numbers. 

Principia was in progress, and these two papers were early prog- 
ress reports. Subsequent progress was both qualitative and quanti- 
tative. In the finished Principia, Parts II-III, the concepts of 
cardinal arithmetic are constructed more elegantly than before and 
a good level of rigor is maintained. The axiom of choice is now used 
explicitly as a hypothesis where needed; and so is another principle, 
the so-called axiom of infinity, which for Principria is equivalent to 
the troublesome law that n ~ n + 1 for finite n (cf. §7). With and 
without the help of these hypotheses, hundreds of theorems on 
finite and infinite cardinals and ancillary topics are deduced. In 
Part V, Cantor’s theory of finite and infinite ordinal numbers is 
developed, followed by remnants of cardinal arithmetic that depend 
on the theory of ordinals. In point both of rigor and of comprehen- 
siveness, these portions of Principia remain the authoritative work 
on cardinals and ordinals. 

The theory of ordinals is part of that of ordered sets; for, just as a 
cardinal is a class of classes all having the same number of members, 
so an ordinal is a class of so-called well-ordered sets all having the 
same number of members arranged in similar orders. To an ordered 
set, two things matter—the members and their order; so we might 
think of an ordered set as a complex, somehow, of a class a and a 
““before—after’’ relation R among members of a. However, specifica- 
tion of a is superfluous; a is already determined by R, being simply 
the field of FR, i.e., the logical sum C‘R of D‘R and C‘R. Hence the 


“4T.e., essential subscripts; for Peano’s notation is used. 
902,378; also 373. 

= Toed., 381 (4-21-22). 

eMTo7d., 379 (2-1). 
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ordered set can be adequately treated simply by identifying it with 
R; and such is the course adopted by Whitehead and Russell. 
Ordered sets are not identified with relations in general, but with 
those relations R that have the “‘before—after” kind of structure. 
This condition on # breaks up into three: 


Asymmetry: (x)(y) ~ (cRy. yRz), 
Transitivity: (x)(y)(z)[(2Ry . yRz) D zkz], 
Connexity: (x)(y)[(x4eC'R.yeC'R) D («Ry V yRa V x = y)). 


In Principia such a relation RF is called serzal. For well-ordering, yet 
a fourth condition is needed: 


(8)[(8 C RB) D (8 = A)). 


Given all four conditions on R, there is still latitude as regards fur- 
ther details of structure; and the class of all well-ordered serial rela- 
tions that agree with RF in all such further details is taken as the 
ordinal number of R. 

It should be noted that Whitehead and Russell’s identification 
of an ordered set with the relation of “‘before—after”’ therein is some- 
what arbitrary; we could identify it rather with the relation “no 
later than” in the set (thus choosing the analogue of ‘“<” rather 
than that of ‘“<’’). This alternative approach, which is gaining in 
favor, has certain advantages; e.g., it restores the ordinal num- 
ber 1,8 anomalously missing under the Whitehead—Russell method. 

The theory of ordered sets, or series, is broader than that of well- 
ordered sets and ordinals. In general, ordered sets or serial relations 
that coincide in all further details of structure are said to belong to 
the same order type. Ordinal numbers are thus special sorts of order 
types, viz., those of well-ordered series. In Principia order types are 
spoken of rather as serial numbers; but their analogy to numbers is 
slight, for, unlike the ordinals, they do not all fall into a serial order 
of ‘magnitude.’ In the theory of ordered sets generally, what has 
interested mathematicians is not a quasi-arithmetic of order types, 
but rather the notion of the limit of a set and derivative notions. 
This branch of theory, known as analytic set theory, is rigorously 
developed from logic in the course of Part V of Principia. 

Order type or serial number is, we saw, the generalization of 
ordinal number that is reached by waiving the requirement of well- 
ordering. But a still broader generalization—called relateon number 


48 This was called to my attention by Dr. Alfred Tarski. 
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in Principia—is reached by waiving the requirement of seriality in 
turn and considering relations generally. The theory of relation 
numbers is the general theory of isomorphism, 1.e., of structural 
identity among relations. It embraces the theory of ordering and 
of well-ordering, inasmuch as serial numbers and ordinal numbers 
are simply the relation numbers of ordered and well-ordered sets; 
but it also embraces wider theories, e.g., that of partial order. The 
theory of relation numbers occupies Part IV of Principia; serial 
and ordinal numbers appear only afterward, as specializations. 
Emphasis is put on the analogies, insofar as there are such, between 
relation numbers and cardinals; but these relationships would have 
come out somewhat more clearly and simply if relations had been 
treated literally as classes of ordered pairs (cf. §6). 


10 


Part VI, “Quantity,” occupies the last half of the last volume of 
Principia. Attention is here turned to the arithmetic of signed 
integers, ratios, and real numbers. Following Peano, the authors 
construe the signed integers +y and —y in effect as the function 
X,(v + uw) and its converse, where py is any finite cardinal. Arithmeti- 
cal notions such as sum, product, and power, defined hitherto for 
cardinals, are now defined again appropriately for signed integers; 
and a train of theorems is deduced. 

Peano’s way of construing ratios, viz., by identifying yu/» with 
p'v’(u X v’ = yp’ X v) for all finite cardinals » and »v, is abandoned 
by Whitehead and Russell in favor of a version whereby p/» be- 
comes a relation not between cardinal numbers but between rela- 
tions. Roughly, u/» is construed as the relation which Q bears to R 
just in case there are things x and y such that 2 bears Q/Q| . . . 
(v times) and likewise R|R| . . . (u times) to y. Think of Q and R 
as vectors or transformations, applicable over and over; then Q 
bears »/v to R if v applications of Q do the work of yu applications of 
k. (In the definition actually adopted a complication is added 
whose purpose is to minimize the dependence of rational arithmetic 
on the axiom of infinity; but it is a complication engendered merely 
by the theory of types.) 

This version of ratios is interesting in that it makes ratios im- 
mediately applicable to relations generally, arithmetical and other- 
wise. The grandparent relation, e.g., stands in the ratio 2/5 to the 
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great-great-great-grandparent relation. Such applicability proves 
useful in a subsequently expounded theory of measurement. A fur- 
ther virtue of this version is the naturalness with which the series of 
ratios so construed admits of extension to include negatives. The 
negative of a ratio P is defined in effect as P | Ark. The negative of 
the ratio 2/5 obtains, e.g., between the grandparent relation and 
the great-great-great-grandchild relation. 

A rea] number is explained, substantially, as any class of ratios 
that (1) does not contain all the ratios, but (2) contains any given 
ratio if and only if it contains also a higher. (Actually an exception 
is made, inelegantly and gratuitously, in connection with 0.*°) That 
those classes form a model of the traditional real number series was 
first pointed out by Dedekind. In this version of real numbers, the 
ratios alluded to are to be understood of course as just positive and 
zero; and the real numbers thus accounted for are likewise just 
positive and zero. But one can build the negative reals on the nega- 
tive ratios in quite analogous fashion; and Whitehead and Russell 
do so. 

They also develop an attractive alternative construction of real 
numbers, the effect of which is to increase the kinship between real 
numbers and ratios. Here a real number is identified not with a class 
a of ratios, as above, but rather with the relation whose ordered 
pairs are got by pooling all the ratios belonging to such a class a. 
Real numbers so construed are, like ratios, relations between rela- 
tions. 

For real numbers in both senses, as well as for ratios, Whitehead 
and Russell set up the full complement of associated arithmetical 
notions and prove many theorems. Then they turn to the conclud- 
ing topic of Principia, which is measurement. In dim outline, the 
development of this topic proceeds as follows. 

Any one-one relation RF that is repeatable (i.e., such that every- 
thing to which RF is borne bears R to something in turn) is called a 
vector. A class x (A) of vectors all having the same field a, and such 
further that the order of applying any two of the vectors is im- 
material (P|Q = Q|P), is called a vector family of a. Now a magni- 
tude is thought of as a vector; the gram, e.g., is thought of as the 
vector ‘“‘a gram more than.” Kinds of magnitude, e.g., mass, length, 
etc., are certain vector families. Measurement, finally, consists in 
determining ratios between members of some vector family x; and 
here we see the special utility of the adopted version of ratios as 
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relations of relations. In cases where irrational measures are called 
for, use is made not of ratios but of real numbers—which still are 
relations of relations, under the second version. A general theory of 
so-called nets is begun, with a view to the introduction of geometri- 
cal co-ordinates in the projected fourth volume. 

Measurement is ordinarily thought of as a concern of natural 
science; and we should scarcely venture to derive the concepts of 
natural science from pure logic. But we may still venture to formu- 
late the conceptual structure of natural science in purely logical 
terms. Insofar as certain terms, say of physics, are constructible 
from other more basic ones by logical devices, we may reproduce 
the constructions using variables in place of those basic physical 
terms. Along these lines we arrive at purely logical functions that 
make up—more and more exhaustively as we proceed—the logical 
component of natural science. “Application” of these logical con- 
structs to the world consists merely in assigning values, of the ap- 
propriate extra-logical sort, to the variables. ( '€5 7 ¢?4 “7. 

Such is the intended status of the theory of measurement devel- 
oped in Principia. When we take « as a certain physically specified 
class of vectors, we have a rudimentary theory of mass; when we 
take «x as another class, we have a rudimentary theory of length; 
and so on. Physical laws, insofar as they imply relevant differences 
between these various choices of x, will supply differences in detail 
between the theory of mass, the theory of length, etc. 

Application of the general theory in this wise encounters some 
difficulty on the score of the repeatability clause in the definition of 
vector. If we construe the gram, e.g., as the relation “heavier by a 
gram than” as between physical objects, then after a certain point 
the finitude of mass of the universe will obstruct repeatability. If 
we are to take the gram as a vector, it seems we must take it not as 
the described relation between bodies but rather as the relation “a 
gram more than’”’ as between entities that are in turn abstract 
quantities in some sense or other. Such, indeed, is the intention of 
Whitehead and Russell; but then we are left farther from the 
physical application than might have been desired. 

Analysis of the logical structure of natural science had been 
occupying Whitehead as early as 1906, when he published a paper 
“to initiate the mathematical investigation of various possible 
ways of conceiving the nature of the material world.”’ The con- 
structions in that paper are couched in the regular Principia nota- 

% Cf. Principia, Vol. 3, 339. 
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tion, and foreshadow to some degree the projected volume on 
geometry; and a continuation is outlined in Whitehead’s later 
writings®*! under the head of ‘‘extensive abstraction.”’ Other con- 
structions in the 1906 paper go far outside geometry; this was the 
beginning of a quest for the broadest, most basic concepts and 
principles of nature, and in the decades since Principia the quest 
has issued in a metaphysics. 


51 1916; 1919; 1920; 1929. 
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at Logic, Symbolic 


Notations reminiscent of algebra were used sporadically in de- 
ductive or formal logic from Gottfried von Leibniz onward, and 
increasingly in the 19th century (George Boole, Augustus De- 
Morgan, William Stanley Jevons, Charles Sanders Peirce). Gottlob 
Frege, in 1879, supplanted the algebraic notation by another which 
better accentuated the structural traits most germane to deduction. 
Flourishing like other happily formulated branches of mathematics, 
deductive logic soon so exceeded its earlier powers as to invite re- 
christening: whence ‘“‘symbolic” or “‘mathematical”’ logic. 

If in a sentence we put dummy letters for all portions other than 
the logical particles ‘or’, ‘and’, ‘not’, ‘if’, ‘every’, ‘some’, and the 
like, we get roughly what may be called a logical form. A form is 
called valid if all sentences having that form are true. The logical 
truths, finally, are the sentences valid in form; and the concern of 
deductive logic, or symbolic logic, is systematic recognition of logi- 
cal truths. 

When a logical truth has the form of a conditional (‘If .. . 
then . . .’), the one clause is said to imply the other logically. 
Implication underlies the practical use of deductive logic, namely, 
deduction: the inferring of sentences, on any subject, from any 
supposed truths which logically imply them. 

The above description is inexact, mainly because of the imperfect 
inventory of the logical particles. A more adequate accounting of 
these particles will emerge as we get on with the substantive theory 
of symbolic logic in the following pages. 

This article was written in 1954 and is reprinted from Encyclopedia Americana, 


1957 and later editions, by permission. The Encyclopedia reserves all rights to 
reproduction of this article elsewhere. 
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1. Truth-Function Logic, or Propositional Calculus. The conjunc- 
tion of any sentences is asentence compounded of them by juxtaposi- 
tion, let us say, and construed as true if and only if the component 
sentences are all true. The alternation of any sentences, formed by a 
connective ‘\V’, is false if and only if the components are all false. 
The negation of a sentence, formed by writing ‘—’ before or above, is 
true if and only if the sentence itself is false. Thus juxtaposition, 
‘Vv’, and ‘—’ may be read ‘and’, ‘or’, and ‘not’, though in abstrac- 
tion from such verbal usage as may diverge from the rigid truth 
conditions just now stated. 

Thus, let ‘p’, ‘g’, and ‘r’ be dummy sentences. Then ‘pq V pg V 
pf will be true if and only if at least one of these three combinations 
is realized: ‘p’ and ‘q’ both true, or both false, or ‘p’ true and ‘7’ 
false. 

A compound sentence is said to be a truth function of its compo- 
nent sentences if the truth value (truth or falsity) of the compound 
is determined by the truth values of the components; that is, if no 
substitution of truths for the component true sentences and false- 
hoods for the false ones will alter the truth value of the compound. 
Obviously any compound is a truth function of its components if 
built up of them by conjunction, alternation, and negation. It is 
easily shown also, conversely, that conjunction, alternation, and 
negation provide a complete notation for the truth functions. 

Alternation can even be dropped, since ‘p V q’ can be rendered 
‘-(pq)’. But the threefold notation is convenient. It enables us to 
render any truth function of given components ‘p’, ‘q’, etc., in the 
perspicuous alternational normal form; that is, as an alternation of 
conjunctions of elements from among ‘p’, ‘p’, ‘q’, ‘Gg’, etc. Transfor- 
mation into that form is accomplished by distribution according to 
three laws: ‘“-(s Vt V ... )’ becomes ‘3f . . .’; ‘-(st . . . )’ be- 
comes ‘§ Viv ...’; and ‘r(s Viv ...)’ becomes ‘rs V rt V 
.... (Terminological note: we speak of conjunction not only of 
many components but also of one, thus counting anything a con- 
junction of itself, and similarly for alternation; so ‘p’, ‘pq’, ‘Db V q¢’, 
etc., count as in alternational normal form.) 

Everything said in the foregoing paragraph remains true when 
we systematically switch the roles of alternation and conjunction. 
Thus any truth function can be put also into a conjunctional normal 
form. This so-called duality between alternation and conjunction 
rests on the fact that their truth conditions are alike except for a 
systematic interchange of ‘true’ and ‘false’. 
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The compounds ‘f V q’ (or equivalently ‘-(pq)’) and ‘pq V pg 
are commonly rewritten ‘p> q and ‘p =q’ respectively, and 
called the (material) conditional and biconditional, and read ‘if p 
then q’ and ‘p if and only if q’. Though they mirror only imperfectly 
the vague idioms ‘if-then’ and ‘if and only if’, they suffice for much 
of the business of those idioms. Eked out by ‘‘quantification”’ (see 
below), they suffice for more of it still—indeed for all of it but the 
questionable business of the subjunctive or contrary-to-fact condi- 
tional. The importance of truth-function theory is due largely to the 
connective ‘—’ and its intimate connection with logical implication, 
namely this: one sentence implies another if and only if the material 
conditional formed from the two sentences is logically true. The 
connective ‘=’ is similarly related to logical equivalence, or mutual 
implication. 

In order to avoid excessive parentheses, dots are used to mark 
major breaks in formulas. A dot, when used instead of mere juxta- 
position to express conjunction, marks a greater break than ‘V’, 
‘>’, or ‘=’; any of these connectives with a dot added marks a 
greater break still; a double dot marks a greater break still; and so 
on. Thus ‘pV.pVq.pqaVr:_) p’ stands for ‘{p V [(p V g) 
tag / r)|} D 7’. 

We can evaluate (that is, determine the truth value of) any truth 
function of ‘p’, ‘q’, etc., for any given assignment of truth values to 
‘p’, ‘q’, etc., by applying the truth conditions for conjunction, alter- 
nation, and negation step by step. Validity of a formula of the logic 
of truth functions (for example, ‘pq V pr V fr V js V Gr V 78’) 
is therefore decidable by tabulating all possible assignments of 
truth values to letters of the formula and evaluating the formula 
for each. Such is the method of truth tables (which in practice admits 
of certain shortcuts). 

Implication between formulas of truth-function logic is likewise 
decidable by truth tables; for one formula implies another if the 
corresponding conditional is valid. Equivalence is then decidable in 
turn by two implication tests. Or, more simply, we may test two 
formulas for equivalence by constructing a truth table for each and 
then comparing the truth tables to see if they are the same. 

Simple examples of implication in the logic of truth functions are 
these: ‘pq’ implies ‘p’; ‘p’ implies ‘p V q’; ‘p.p D q’ implies ‘q’. 
Each, characteristically, repeats ‘p’ and so could quickly engender 
fallacy if a sentence substituted for ‘p’ were capable of being true at 
some points and false at others within the same train of logical 
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argument. To avoid this fallacy of equivocation we do not need to fix 
all ambiguities in the sentences to which logic is applied, nor all 
references of pronouns and demonstratives; but we do need to fix 
such of them as might otherwise vary by force of context within the 
space of the argument. 

Truth-function logic was pursued by the Stoics and by Petrus 
Hispanus and William of Ockham. Reappearing as a variant of the 
algebra of classes (next topic), 1t was perfected by Peirce, Frege, 
Ernst Schréder, and Emil L. Post. 


2. Algebra of Monadic Predicates or Classes. General terms or 
predicates are roughly nouns, adjectives, and verbs. This gram- 
matical trichotomy is indifferent to logic; a more germane distinc- 
tion is that between absolute general terms or monadic predicates 
(for example, ‘man’, ‘red’, ‘thinks’) and relative general terms or 
polyadic predicates (for example, ‘uncle of’, ‘greater than’, ‘takes’). 
Sentences are true or false; monadic predicates are true or false of 
objects; dyadic predicates are true or false of pairs of objects; and 
so on. Accordingly the truth functions, which apply to sentences, 
have analogues—called Boolean functions—applicable to predi- 
cates: a conjunction (“logical product’’) of predicates is true of 
whatever the predicates are jointly true of; an alternation (“logical 
sum/’’) is true of what at least one of the predicates is true of; and a 
negation is true of what the predicate is false of. 

In the algebra of monadic predicates, also called class algebra, we 
build Boolean functions of dummy predicates ‘F’, ‘G’, ete. Also two 
special predicates emerge, ‘V’ and ‘A’, true of everything and 
nothing. Joining any such expressions by ‘=’, we get Boolean equa- 
tions. An appropriate reading of ‘F = G’ in general is ‘All F are G 
and vice versa’; hence in particular ‘FG = A’ amounts to ‘No F are 
G@’, and ‘FG = A’ to ‘All Fare G@’. ‘FF = A’ and ‘F V F = V’ are 
valid, or true for all interpretations of ‘F’. Any Boolean equation 
has an equivalent with ‘V’ as one side, and another with ‘A’ as one 
side; for ‘F = A’ is equivalent to ‘F = V’, ‘fF = V’ to 7a 
and, in general, ‘F = G to ‘FG V FG = V’ and to‘FG V FG = 
i. 

This algebra was developed mainly by Boole, Jevons, and 
Schréder. Its formulas (valid and otherwise) comprise the Boolean 
equations and all truth functions of them. Valid examples include: 


GH) FG + A..GH =A. Dee 


and the like, answering to syllogisms in traditional logic. 
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What things ‘V’ is to be true of, and what things ‘F’ is to be true 
of, given an interpretation of ‘¥’, depend on our choice of “universe 
of discourse’ (DeMorgan). This choice may conveniently be varied 
from application to application; so formulas are counted valid only 
if true under all interpretations of ‘F’, ‘G’, etc., in all non-empty 
universes. The empty universe is profitably excepted because some 
formulas, for example ‘V # A’, fail for it which hold generally 
elsewhere. The question whether a formula also holds for the empty 
universe is easily settled, when desired, by a separate test; for all 
Boolean equations hold true for the empty universe, and accord- 
ingly any truth function of them can be tested by ‘‘evaluation’”’ (see 
above). 

Validity is of interest mainly as a key to implication, which is 
validity of the conditional. Now in simple cases implications are 
easily checked directly, by diagramming classes as overlapping 
circles (John Venn). To explore the consequences of given Boolean 
equations and inequalities we shade regions of the diagram which 
the equations declare empty, and flag regions which the inequalities 
declare occupied, and observe the effects. 

But the prime desideratum is a general decision procedure for this 
part of logic, such as the truth table provides for truth-function 
logic; i.e., a mechanical method of testing any truth function of 
Boolean equations for validity. The earliest such procedure was 
established by Leopold Léwenheim; others, easier, have appeared 
since. One is as follows. 

Let us suppose all equations (and inequalities) standardized so 
as to have ‘A’ as right-hand member, and no ‘A’ nor ‘VY’ in the left 
member. (Occurrences of ‘A’ and ‘V’ on the left can be got rid of by 
substituting ‘FF’ and ‘F V F’, or by a more efficient method which 
need not detain us.) Now any alternation of (one or more) such in- 
equalities, e.g.: 


(2) =G) AY GH SVS A, 


can be tested for validity as follows: delete ‘+ A’ everywhere, and 
test the result ((FG) V GH V FH’, in the case of (2)) for validity 
by truth table (as if ‘F’, ‘G’, etc., were sentence letters). 

The procedure can be extended to any alternation of formulas 
whereof one is an equation and the rest (if any) are inequalities, as 
follows: weaken the alternation by changing its one equation to an 
inequality and negating the left member thereof; then test the thus 
weakened alternation as in the preceding paragraph. For example, 
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‘FG = A.V.GH # A.V.FH # A’ (which is the syllogism (1), 
converted to normal form) weakens to (2). Such weakening is by no 
means an equivalence transformation; yet it does leave non-valid 
formulas non-valid and valid ones valid, as long as we adhere to 
alternations with only one affirmative equation. (This is not evi- 
dent, but can be proved.) 

The procedure can be extended in turn to any alternation of 
equations and/or inequalities; for an alternation which includes 
several affirmative equations is valid if and only if, by dropping all 
but a certain one of its affirmative equations, it can be got down to 
an alternation valid according to the preceding paragraph. (This 
again is not evident, but can be proved.) 

Finally we can test any truth function whatever of Boolean equa- 
tions, as follows. We put the whole into conjunctional normal form, 
handling its equations as one would sentence letters. The result is a 
conjunction of alternations as of the preceding paragraph; and the 
whole is valid if and only if each of those alternations is valid. 


8. Quantification. Conjunction, alternation, and negation have 
their analogues for polyadic predicates, as for monadic predicates. 
But there are also further operations, applicable to polyadic predi- 
cates, which have no analogues for sentences or monadic predicates. 
One is converse, which, applied to ‘greater than’ or ‘parent’, gives 
‘less than’ or ‘offspring’. Another is relative product, illustrated by 
‘friend of father of’. Another is mage, which combines dyadic and 
monadic predicates as in ‘friends of musicians’. The algebra of 
dyadic predicates or relations (DeMorgan, Peirce, Schréder), which 
treats of these matters, is far more complex and less intuitive than 
the algebra of monadic predicates or classes. On the other hand the 
logical inferences for which this extended algebra provides, and 
further ones for which it does not, prove easily manageable under a 
more analytical approach which departs from the pattern of an 
algebra of predicates; namely, the logic of quantification, founded 
by Frege. 

The universal quantifier ‘(x)’ may be read ‘Each object x is such 
that’, and the existential quantifier ‘(Ax)’ may be read ‘At least one 
object x is such that’. These are prefixed to open sentences such as 
‘z is red’ which, of themselves, are neither true nor false because of 
the free ‘a’. The resulting quantifications in this instance, ‘(x)(x is 
red)’ and ‘(Aaz)(x is red)’, are closed sentences and are respectively 
false and true. 
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Writing ‘Fz’ to indicate application of the predicate ‘F’ to ‘x’, we 
can express ‘All F are G’ and ‘Some F are G’ as ‘(x)(Fx D Ga)’ and 
‘(ax) (Fx . Gx)’, and ‘No F are G’ indifferently as ‘(x)(Fx D -@2)’ 
or ‘—(axr) (Fx . Gx)’. Typical purposes of Boolean equations can thus 
be served alternatively by quantification and truth functions. In- 
deed any Boolean equation can be translated into these terms; first 
we rephrase it with ‘V’ as its right side, and then we translate it as 
‘(z)(. . . )’ where ‘. . .’ is got from the left side by inserting ‘2’ 
after each predicate letter. 

But the great value of quantification theory resides in the fact 
that it is adequate equally to polyadic matters. Here we write predi- 
cate letters followed by multiple variables: ‘Fzy’, ‘Gayz’. The effects 
of the above notions of converse, relative product, and image are 
then got by ‘Fyz’, ‘(3y)(Fxy . Gyz)’, and ‘(ay) (Fry . Gy)’. 

The use of variables other than ‘xz’ in quantifiers changes the 
sense of the quantifiers in no way, but serves merely to preserve 
cross-references, as in ‘(3x)(y)F zy’; this is the quantification by 
‘(ax)’ of the open sentence ‘(y)F zy’. 

The valid formula ‘(az)(y)Fxy D (y)(ax)Fxy’ illustrates the ex- 
tended coverage which quantification theory affords as contrasted 
with the algebra of monadic predicates or, a fortiori, the traditional 
theory of the syllogism. 

The formulas of quantification theory are built up from atomic 
formulas ‘p’, ‘9’; ‘Fa’, ‘Fy’, ‘Gry’ , ‘Hyx’, ‘J xyz’, etc., by truth func- 
tions and quantification. The convention is that the atomic for- 
mulas represent any sentences, subject to these conditions: (1) If 
the atomic formula occurs under a quantifier whose variable it 
lacks, then the sentence which it represents must lack free occur- 
rences of that variable. (2) If two atomic formulas are alike except 
for variables (e.g., ‘Fx’ and ‘Fy’), then the represented sentences 
must be similarly related. 

Interpretation of a formula consists in choosing a universe as 
range of values of ‘x’, ‘y’, etc., choosing truth values for ‘p’, ‘q’, etc., 
choosing specific objects of the universe for any free variables, and 
deciding what objects (or pairs, etc.) the predicates ‘F’, ‘G’, etc., 
are to be true of. A formula is valid if true under all interpretations 
with non-empty universes. As before, the case of the empty uni- 
verse is easily handled by a separate test; for all universal quanti- 
fications are there true, and all existential ones false. 

Implication, as usual, is validity of the conditional. In particular 
‘Fyy’ implies ‘(ar)F xy’. This example of implication, and closely 
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similar ones, will now be given a special name. The existential 
quantification of any formula £, with respect to a variable a, will be 
called an existential consequence of any formula which is like & ex- 
cept for having free occurrences of some variable 8 wherever £ has 
free occurrences of a. 

By a surface occurrence of a formula ¢ in a formula y let us under- 
stand an occurrence which is overlaid by no quantifier or negation 
or ‘)’ or ‘=’; nothing but conjunction and alternation. Now it 
is readily seen that the following rule of inference leads from formu- 
las always to formulas which they imply. 


Rue I: Supplant a surface occurrence of ¢ in W by an existential 
consequence of o. 


Example: two steps by Rule I lead from ‘-Fwz V Fwz’ to ‘(3y) 
—FPwy V (ax) F x2’. 

The next rule of inference does not lead always to implied 
formulas, but it does lead from valid formulas to none but valid 
formulas. 


Rute II: Jf @ ts a variable whose free occurrences in W lie wholly 
within a surface occurrence of , then insert a universal quantifier 
so as to quantify ¢ with respect to a. In so doing you may also change 
a to a new letter B, provided that B is foreign to ¢. 


Example: two steps by Rule II lead from the result of the pre- 
ceding example to ‘(x)(ay)-Fay V (y)@€z)Fzy’. 

The next two rules obviously lead from formulas to equivalent 
formulas. 


Rute III: Supplant any clause at will by any which ts equivalent 
to it by truth tables. 


Rue IV: Change ‘(ax)-’, ‘(x)-’, ‘(Ay)’, etc., at will to ‘(a)’, 
‘-(3x)’, ‘—(y)’, etc., respectively. 


Example: two steps by Rule IV lead from the result of the pre- 
ceding example to ‘—(4r)(y)Fzy V (y)(ax)Fry’, or ‘€z)(y)Fxy D 
(y) (ax) Fry’. 

As the above chain of examples illustrates, Rules I-IV enable us, 
starting with a formula which is valid by truth table (“-Fwz V Fw’ 
in the example), to generate further valid formulas of quantifica- 
tion theory. This technique is in fact complete (Kurt Gédel, Jacques 
Herbrand), that is, capable of yielding any valid formula. 

It is convenient in practice, though strictly unnecessary, to 
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invoke supplementary rules, based, for example, on the implication 
of ‘Fy’ by ‘(x)F2’, or the equivalence of ‘(x)(Fx .Gz)’ to ‘(x) Fx. 
(aG@e or of ‘ar)(Fz V Gx)’ to ‘ar)Fa V @x)Gz’. 

Despite its completeness, the described proof procedure is not a 
decision procedure (see above). For, even if we fail to find the proof 
of a formula, we may not know whether the formula is nevertheless 
valid, its proof having merely eluded us. In truth-function logic 
and again in the algebra of monadic predicates we saw decision 
procedures; for quantification theory, however, none is possible 
(Alonzo Church). 


4. Identity. The sign ‘=’ was used in earlier pages to form true 
sentences from coextensive predicates. But it has its primary use 
rather between variables ‘x’, ‘y’, etc., as a dyadic predicate in its 
own right, expressing identity. The theory of identity can be 
summed up in the axiom ‘(x)(z = x)’ and the axiom schema 
‘(x)(y)(c = y. Fx.D. Fy)’, wherein ‘Fx’ and ‘Fy’ represent any 
sentences which are alike except that the one has free ‘x’ at some 
points where the other has free ‘y’. 

Gédel showed that every truth in the notation of identity 
theory is obtainable from this basis by quantification theory. For 
example, one case of the axiom schema is ‘(z)(y)(@ = y.z2 = 
z._).y = x)’, which, with ‘(z)(@ = x)’, yields ‘(x)(y)(a@ = y.D. 
y= 2x)’. 

Identity, added to the truth functions and quantifiers, enables 
us to deal with the idioms ‘only x’, ‘everything else’, and the like. 
From ‘Someone on the team admires everyone else on the team’ 
and ‘Some fielder on the team is admired by no one’ we can argue 
by means of quantification theory plus identity and its axioms, 
but not by means of quantification theory alone, that some fielder 
on the team admires everyone else on the team. 

The adding of identity provides also for a rudimentary treatment 
of number: we can express ‘There are exactly n objects x such that 
Fz’, symbolically ‘(3,x2)Fx’, for each fixed n. For, ‘(49z)Fx’ can be 
rendered ‘—(4r)Fx’, and ‘(3,7)Fx’ for each succeeding n can be 
rendered ‘(3xr)[Fx . (An1y) (Fy . x ¥ y)])’ (Frege). 

The adding of identity enables us also to introduce an operator 
of singular description ‘(1.x)’ (Frege, Giuseppe Peano) in such a way 
as to serve the useful purposes of the words ‘the object x such that’. 
The trick (essentially Bertrand Russell’s) is to explain ‘(.7)F2’ 
not outright, but in any atomic context. Thinking of ‘G(:x)F2’ as 
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any such context of ‘(17)Fx’, we explain ‘G(.x)Fz’ as short for 
‘(ay)[Fy . Gy . (x)(Fz D. x = y))’; that is, some F is G and nothing 
but it is F. This device enables us to handle singular terms within 
the framework of pure quantification theory with identity. Con- 
sider, for example, the function sign or operator ‘+’, which pro- 
duces compound singular terms of the form ‘y + 2’. Its purposes 
can be served by an ordinary triadic predicate ‘2’, where ‘Dxyz’ 
means that x is the sum of y and z; for, ‘y + z’ can then be taken as 
‘(ax) Dayz’, subject to the general contextual definition of ‘(:2) Fz’. 


5. Set Theory. In ‘=’ we have a logical predicate. There is an- 
other, the predicate ‘e’ of class membership, which is more powerful: 
so much so that it may with some justice be viewed rather as 
mathematical, in a sense exclusive of logic. Once ‘e’ is assumed, ‘=’ 
is dispensable; for ‘x = y’ can be paraphrased as ‘(z)(xez._. 
y € z)’. Much else also becomes expressible, for which ‘=’, quanti- 
fiers, and truth functions were inadequate. For example, we can 
now express ‘ancestor’ in terms of ‘parent’. For, one’s ancestors (if 
for simplicity we reckon oneself among them) comprise the com- 
mon members of all classes which contain oneself and all parents of 
members (Frege); and ‘xz is ancestor of y’ can accordingly be 
rendered ‘(z)[yez.(u)(w)(u is parent of w.wez._..uez)._. 
HreZ)’. 

The logic of ‘e’ is set theory, or class theory, in a more genuine 
sense than was the algebra of monadic predicates. In the latter 
there is no call to posit a realm of classes. ‘F’, ‘G’, etc., stand in 
place of predicates, but a predicate, like an open sentence, is 
simply an expression which is true of many or one or no objects. 
With the advent of ‘e’, on the other hand, classes are demanded as 
actual members of the universe of quantification; compare the ‘2’ 
of the last example. 

At this stage therefore there is a place for class names, as against 
mere predicates. Such names are formed by the notation of class 
abstraction: £F x is the class of those objects x such that Fx. Actually 
this notation can be introduced in terms of prior notations, as short 
for ‘(ay)(x) (x ey .= Fx)’. In particular ‘é(z = x)’ and ‘ta 4 ay 
are abbreviated ‘V’ and ‘A’; but this use of these signs as names of 
classes is not to be confused with our earlier use of them as predi- 
cates. Similarly the Boolean functions can be explained now for 
classes: ry as &(zex.zey), x Vy as &(zex.V.zey), and & as 
2-(z ex). The algebra of monadic predicates can of course be read 
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from the start with ‘PF’, ‘G’, etc., as class variables and ‘F = G@’ as 
ordinary class identity; but to do so is a gratuitous positing of 
entities before necessity. 

Relations are understood as classes of ordered pairs (or triples, 
etc.). Now the one thing demanded of a concept of ordered pair, 
z;y, is that 2;y = z;,w.D.x% = z.y =w. This is demonstrably 
fulfilled by any of the various artificial definitions of ‘x;y’ within the 
theory of classes (Norbert Wiener). For example, having defined 
{x} as 4(z = x) and {z, y} as {x} V {y}, we can take 2,y as {{z}, 
{x, y}}. Thereupon relational abstraction, ‘éjFxy’, becomes de- 
finable as ‘2(4x) (Sy) (2 = x;y . Fry)’. The theory of relations is thus 
obtained within set theory. Analogues, in particular, of the notions 
of the algebra of dyadic predicates are forthcoming: the converse of 
a relation z is £9(y,;z €z), the relative product of u into w is £9 (32) 
(a;zeu.z;yew), and the image of w by z is £(y)(a;yez. yew). 

Since the algebras of predicates cover less than quantification 
theory itself, their reproducibility in set theory is no motive for set 
theory. The gain in power afforded by ‘e’ and classes 1s seen rather 
in the ancestral construction (above), and again in connection with 
number, to which we now turn. 

In defining ‘(a,x)F2’ for each fixed n we made no provision for 
numbers as values of variables of quantification. In set theory, 
however, we can so provide, construing each number as the class of 
those classes having that number of members (Frege). Thus 0 is 
definable as {A}, and in general w+ 1, or Sw, is definable as 
g(x)(x@ey.y—{x} ew). Thereupon, following the plan of the defi- 
nition of ancestor, we can explain ‘Num 2’ (‘x is a number’) as short 
for ‘(z)[(Oez. (w)(wez.D.Swez). >. xez]’ (Frege). 

Now we have the means of saying, for example, not only that a 
class x has five members, (35z)(z ea) or xe 5, but also that x has 
just as many members as y; viz., (3z)(Numz.vez.yez). Wecan 
go farther: we can express the whole of number theory. For ex- 
ample, x + y is definable as comprising those classes z such that 
part of z belongs to x (i.e., has « members) and the rest to y; sym- 
bolically, 2(aw) (zw ex . zy € y). The product x-y is definable, more 
complexly, by exploiting the fact that if a class z has x mutually 
exclusive classes as members, and each of these has y members, 
then there are x-y members of members of z. 

Numbers, as thus far considered, are the sizes of finite classes, 
that is, the positive integers and 0. But Georg Cantor’s theory of 
infinite class sizes, or infinite numbers, can likewise be expressed 
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within our present terms; a main step towards it is the easily de- 
fined notion of a one-one relation. 

Finite numbers of other sorts than 0, 1, 2, . . . are manageable 
as well. Somewhat artificially we can identify ratios 14, 34, etc., 
with the pairs 1;2, 2;3, etc., and then redefine sum, product, and 
other relevant notions appropriately for application to ratios so 
construed. Irrational numbers prove to be satisfactorily identifiable 
with certain infinite classes of ratios (Richard Dedekind), and 
finally negative and imaginary numbers can be accommodated by 
further recourse to the device of ordered pair. Mathematical func- 
tions can be identified with certain relations, namely, the relations 
of values to arguments of the functions ordinarily so-called; for 
example, the function “square of,’”’ symbolically A(x), is definable 
as gi(y = x?). By such methods, set forth in detail by Frege, 
Peano, Alfred North Whitehead, Bertrand Russell, and others, set 
theory is shown to embrace classical mathematics 1n a very general 
sense of the term. The truth functions and quantifiers and ‘e’ thus 
emerge as, in theory, a comprehensive notation for mathematics. 

The question of suitable axioms for ‘e’ is, in effect, therefore, a 
question of suitable axioms for mathematics. To begin with there 
is the axiom of extensionality: (w)(wex.=.Wey).2E2Z._.Y €%. 
This is clearly wanted where x and y are classes; and it can be 
adopted without restriction if we identify each non-class x, arti- 
ficially but conveniently, with {x}. Now what are wanted by way 
of further axioms are those of class existence: axioms of the form 
‘€F x exists’, that is, ‘(ay)(z)(xey .= Fx)’. But actually we can dis- 
prove one such sentence, thus: ‘-[yey.=—(yey)]’ is valid by 
truth table, and from it by Rule I and Rule II we get ‘(y)(@z) 
-[cey .=—-(rex)]’, and so, by Rule IV, (ay) (a) [z ey .= -(wex))’ 
(Russell’s paradox). Infinitely many such counter-examples are 
known. Nor can we somehow banish just such refutable cases and 
keep ‘(Sy)(x) (a ¢ y .= Fx)’ for the rest; for various of the remain- 
ing cases, not refutable individually, are mutually inconsistent. 

Varied systematic proposals have been made, since 1903 (Frege), 
in an effort to encompass an 1n some sense optimum set of consist- 
ent cases of ‘(Jy)(x)(xey .= Fx)’. One system, Russell’s theory of 
types, stratifies the universe into individuals, classes of individuals, 
classes of such classes, and so on, and then, appropriating a dis- 
tinctive style of variable to each such type, rejects as meaningless 
any formula containing ‘e’ otherwise than between variables 
appropriate to consecutive ascending types. The cases of ‘(ay) (x) 
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(x ey .= Fx)’ which survive as meaningful are consistent. But 
the resulting theory is unwieldy in certain respects; moreover it 
proves to require supplementation with an axiom of the infinitude 
of individuals, if we are to preserve the law ‘4 +1=y+41.D. 
x = y’ of number theory. 

Ernst Zermelo’s method was to assume all cases of the form 
‘(Jy)(z) (wey .=.xez.Fx)’ (Aussonderungsaziom) plus a certain 
assortment of further cases by way of providing classes z for the 
Aussonderungsaxiom to operate on. This method has equally 
serious drawbacks. 

John von Neumann’s method was to declare some classes in- 
capable of being members. Thereupon ‘(dy)(x)(~ey .= Fx)’ can 
be assumed in general with ‘x’ restricted to elements, i.e., classes 
of the kind capable of membership. (He did not assume this much, 
but we may.) It remains, in such a theory, to institute conditions 
of elementhood. Von Neumann modeled his conditions of element- 
hood on Zermelo’s conditions of class existence; Willard V. Quine 
and Hao Wang modeled theirs rather on certain traits of the theory 
of types. 

Choice among such alternative foundations of set theory hinges 
on relative naturalness, elegance, convenience, power, and likeli- 
hood of consistency. Consistency proofs are not absolute, since 
they assume the consistency of the theory in which they are con- 
ducted; but a theory is occasionally bolstered by a consistency 
proof relative to a less suspect theory. 

The clear optimum in set theory is not at hand, and, in one 
sense, never can be: for Kurt Gédel has proved that no theory 
adequate to expressing so much as the elements of number theory 
can have a complete and consistent proof procedure. The com- 
pleteness noted in quantification theory, though attainable also 
in the elementary algebra of real numbers (Alfred Tarski), is 
possible neither in the elementary theory of whole numbers nor 
in various other parts of mathematics; nor, a fortiorz, in the foun- 
tainhead which is set theory. 

' 6. Further Aspects. The lack of a unique clear line in set theory 
has encouraged some (e.g., Hermann Wey] and at one point Rus- 
sell) to espouse constructionalism. The constructionalist in set 
theory stratifies the universe of classes into so-called orders, and 
assumes ‘(dy)(x)(x ey .= Fx)’ only when all quantified variables 
other than ‘y’ are limited to orders lower than that assigned to ‘y’. 
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(Henri Poincaré: “predicative definition.”’) This course is safer, 
more intuitive, and epistemologically more scrupulous than others, 
but it is inadequate to certain classical theorems in the theory of 
functions of real numbers—theorems which constructionalists are 
accordingly prepared to abandon. An extreme variant of construc- 
tionalism is intuztionism (Liutzen E. J. Brouwer, Arend Heyting), 
which even revises elementary logic, abandoning the law ‘p V 9’ 
of excluded middle. 

A different departure from truth-function logic is seen in modal 
logic (Clarence J. Lewis, Rudolf Carnap, Frederic B. Fitch), which 
admits, on a par with ‘and’, ‘or’, and ‘not’, the operator ‘neces- 
sarily’. Against the need for this departure it has been argued that 
the purposes thus served are better served by talking about 
formulas, attributing validity and tracing implications as hereto- 
fore in this article. 

The standard elementary logic is also frequently departed from 
in the purely exploratory spirit of abstract algebra—notably in 
many-valued logic (Jan Lukasiewicz, J. Barkley Rosser, Atwell R. 
Turquette), where the number of truth values is generalized from 
two to n. 

Apart from such substantive deviations, we can radically vary 
the mode of development of the standard sort of theory. For ex- 
ample, instead of taking membership and quantification as funda- 
mental, we may begin with identity, functional abstraction ‘A,’ 
(see above), and functional application. If by a harmless artifice 
we look upon sentences as names of their truth values, then 
identity and functional abstraction and application become a 
sufficient basis for defining membership and quantification (Frege, 
Church) and even the truth functions (Tarski). This alternative 
train of construction is less practical in some ways than that which 
starts with truth functions, quantification, and membership, but 
it brings added illumination. For functional abstraction can be 
shown eliminable, in turn, in favor of a few specific functions, 
called combinators (Moses Schonfinkel, Haskell B. Curry). We 
thus come out with a foundation for logic, including quantification, 
set theory, and their entire mathematical suite, which is devoid of 
variables. 

Proof theory is a domain drawn on but not described in the above 
survey. It includes the proofs of the completeness and undecida- 
bility of quantification theory (Gédel, Church), the incompleta- 
bility of elementary number theory (Gédel), the completeness of 
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elementary algebra (Tarski); also the Lowenheim-Skolem theorem 
that any consistent set of formulas of quantification theory is in- 
terpretable in the universe of whole numbers. 

A powerful device in proof theory is that of numbering all the 
marks and strings of marks available in the notation of a theory, 
so as to obtain numerical relations parallel to the inferential re- 
lations between sentences of the theory concerned. It was by thus 
applying number theory to the sentences of number theory, in 
particular, that Gédel contrived to prove incompletability. Such 
numbering also underlay the discovery of the number-theoretic 
concept of recursiveness, which makes precise sense of the idea of 
mechanical computability (Herbrand, Gédel, Church, Alan M. 
Turing, Post, Stephen C. Kleene). The exact definition of ‘‘de- 
cision procedure”’ turns on recursiveness, as do the exact statement 
and proof of Church’s theorem and Gédel’s incompletability 
theorem. Relevant also to the theory of machine computation, 
recursiveness is the worthy focus of a new branch of number theory. 


Itt 


«t A Method of Generating 
Part of Arithmetic Without 
Use of Intuitive Logic 


1. Introduction. By an identity I mean an equation which is a 
theorem of ordinary arithmetic (and hence true for all values of its 
variables). What is meant here by “ordinary arithmetic” may be 
left to the reader, granted the following three reservations: 


(a) The equations 
(A) 2 2 — a), 
(B) o— yy —&) St Ge) 


are identities, that is, theorems of ordinary arithmetic. 


(b) Inference according to either of the following rules is valid in 
ordinary arithmetic: 


(R) Substitute any one expression for all occurrences of any variable. 
(R’) Given a = B, put a for B anywhere. 


(c) An equation whose members open with the same variable and 
contain only variables connected by subtraction, without paren- 
theses, is not an identity unless each variable occurs exactly as 
many times in one member as in the other. 


Copyright American Mathematical Society 1934 from the Bulletin of the 
American Mathematical Society, Vol. 40, No. 10, pp. 753-761. 

1 For suggestions in the preparation of this paper I am indebted to E. V. 
Huntington and T. P. Palmer. 
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A homogeneous linear identity with rational coefficients is an iden- 
tity whose members contain only plus and minus signs, paren- 
theses, ‘0’, variables, and rational numerical coefficients. Aside 
from the prefixture of such coefficients, no multiplicative juxtaposi- 
tion is admissible. The expressions thus admitted can all be defined 
in terms of variables, parentheses, and the signs of subtraction and 
equality. These definitions, which have the status of mere conven- 
tions of notational abbreviation, will be set forth in §3. 

It ts the business of this paper to show that all homogeneous linear 
adentities with rational coefficients, when thus analyzed into subtrac- 
tion, equality, and variables, can be generated from (A) and (B) by 
means solely of mechanical substitution according to (R) and (R’). 


2. Generation of Theorems. (A) and (B) comprise the postulates, 
and (R) and (R’) the rules of inference, of a calculus whose primi- 
tive language contains only variables, the functor of subtraction, 
and the equality sign.? The parentheses represent nothing in the 
way of an additional primitive idea, but merely constitute a part of 
the notation of the functor of subtraction. This functor is the nota- 
tional scheme of writing expressions in the respective blanks of the 
matrix ‘ — ( )’. No parentheses are needed enclosing 
the blank to the left of ‘—’ in this matrix, for, as is easily verified 
relatively to subtraction or any other binary operation, the condi- 
tion that all right-hand operands and only right-hands operands 
are to be parenthesized suffices in any context for the unique deter- 
mination of the grouping of left-hand operands as well. 

Strictly, (A) and (B) should appear as 


z= x— (y— (y)), = Gey a— & =e): 


The suppression of parentheses enclosing single letters is an ellipsis; 
in using (R) and (R’), we are to imagine that single letters immedi- 


2'The primitive dyadic relation of equality might be suppressed in favor of 
a primitive predicate; if namely ‘3’ denote the predicate of equality with 0, 
so that ‘8’ is read ‘t vanishes’, we can render any equation a = Bas Ba — B. 
(A) and (B) become 


Bx-(¢-—(y-y)) and B (@-(y-z))-@-(y-2)). 


(R) remains unchanged, but (R’) refers now to 3 a — @ instead of a = 8. 

Indeed, since in the thus modified calculus all theorems and postulates begin 
uniformly with ‘¥’, that letter might be dropped entirely; the list of theorems 
and postulates would then appear as a list of alternative expressions of 0. A 
system is thus arrived at whose theorems and postulates are expressive of 
elements of the system itself, as is the case in the logical calculus of proposi- 
tions. 
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ately to the right of minus signs are enclosed in parentheses, and 
we are to make such parentheses explicit when a complex is sub- 
stituted for such a letter. 

Let us proceed to the generation of theorems from (A) and (B) 
by (R) and (R’). The manner of generation is indicated, to the 
right of each theorem, by literal or numerical reference to the 
postulates or intermediate theorems from which the theorem in 
question is derived. A single reference ‘t’? means that the generation 
proceeds by substitution in the theorem or postulate (¢) according 
to (R). A double reference ‘(s), (t)’ means that the generation 
proceeds by replacement of an occurrence of the right member of 
(s), within (t), by the left member of (s), on the authority of (R’). 


(1) <= k, (A), (A) 
(2) xr=x-— (z-2), (A) 
(3) pe Ay (B) 
(4) g—(¢ —&— 2) 2 ee (B) 
Oe @- @-— 2 @ 4) = 2S9—- ee) (B) 
(6) Y-y- © - Bose a eae (1) 
(7) pS ae ee) (2), (4) 
(8) t—2-yow—w— yy — 2-2 
(9) t—y-2=w—w—-@-C@—- 7) 
(10) 2-Yy-—a2)=2-y- @—2—-2)) eee 
(11) 2G Sy Ss ee ee (10) 
(12) t-y-z=w-—w-—Yy-—-@-2y7ere 
(13) z—(y—z)=z-y—-@—z—2), Bee 
(14) t— Yo) = fy yaa (13) 
(15) Z-y-2=x-2-y, (8), (12) 
(16) z2-z2—-(@—y)=2z2-(¢#-y) -2, (15) 
(7) RS 2, (7), (16) 
(18) y= oS ey — (3), 7) 


(19) s=2-y-y-y-y)) eee 


GENERATING PART OF ARITHMETIC 55 
(20) my — YY — (ae er), (4), 
ee oy —y—- (fe —x—- x)= 2. (20), (6) 


3. Phrases. By a phrase I shall mean any expression built up of 
variables and the functor of subtraction. A more rigorous descrip- 
tion is the following: Letters, also the expression ‘z — y’, are 
phrases, and if a letter in a phrase be replaced by a phrase (with 
restoration of requisite parentheses, in conformity with §2) the 
result is a phrase. A phrase equation is an equation whose members 
are phrases; a phrase identity is a phrase equation which is an 
identity. 

A homogeneous linear identity with rational coefficients is not 
necessarily a phrase identity, for, over and above expressions of 
subtraction, its members may involve expressions of algebraic 
negation or addition, the sign ‘0’, and rational numerical coeffi- 
cients. These further devices are all definable, however, as means 
of abbreviating phrases or phrase equations. ‘0’ can be explained 
as an abbreviation for the specific expression ‘x — x’. Negation 
can be accounted for by defining —z in general as 0 — z, that is, by 
construing ‘— ( )’ as an abbreviation for ‘0 — ( )’. Ad- 
dition can be introduced similarly by defining x +- y asx — (—y), 
and the integral numerical coefficients are provided for by defining 
2x asz-+ 2, 3x as 2x + x, and so on.’ The symbols ‘Oz’ and ‘12’ 
may be taken as notational variants of ‘0’ and ‘z’. 

Fractional numerical coefficients, finally, can be defined con- 
textually by the following abbreviative conventions. An equation 


m 
ee eee ee 


with any disposition of parentheses, is an abbreviation for 
Rane — ... —wnt— ... —mMP— me; 


with the corresponding disposition of parentheses, where m is 
any numeral and n any numeral other than ‘0’. The corresponding 
convention is adopted where the fraction occurs to the right of the 
equality sign. If several fractional coefficients occur in an equation, 
their explanation by the above conventions is to proceed in a left- 
to-right order. This set of conventions provides for the occurrence 
of fractional coefficients not only in subtraction but in the other 


3 After Huntington, 15. 
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contexts as well, when the definitions of the preceding paragraph 
are brought into play.‘ 

Every homogeneous linear identity with rational coefficients is 
either a phrase identity or an abbreviation of a phrase identity by 
the above abbreviative conventions. The proof that every homo- 
geneous linear identity with rational coefficients is, upon elimina- 
tion of the above abbreviations, generable by (R) and (R’) from 
(A) and (B), will therefore consist in showing that every phrase 
identity is generable by (R) and (R’) from (A) and (B). 


4. Canonical Form of an Equation. An equation will be said to be 
of canonical form if each of its members is a phrase in which no 
parentheses occur and in which like letters appear consecutively, 
and if the initial letters of the two members are alike and the order 
of succession of unlike letters is, barring omissions, the same in both 
members. The canonical form is thus 


(i) Oieon Ul Up mm ee ee Se a ee 
SY TU Tm Um wl lw See 


where v; and »; are unlike letters unless 7 = 7, and where, for any 
m, the number of occurrences of ‘—»v,,’ in either member may be 0. 

In saying that an equation £ is derivable from an equation F, 
I shall mean that E is generable by (R) and (R’) from (A), (B), 
and F. In saying that E is interderivable with F, I shall mean that 
E is derivable from F and F from E. In saying that an equation is 
reducible to canonical form, I shall mean that it is interderivable 
with an equation of canonical form. 

Since the initial letters of the members of an equation of canon- 
ical form are alike, §1(c) tells us that an equation (i) of canonical 
form is not an identity unless each variable v; occurs the same num- 
ber of times in one member as in the other. But under such circum- 
stances the members of (i) are exact duplicates of each other, so 
that the equation is generable by (R) from the principle (1) of 
self-equality. Since (1) is one of the theorems already generated by 
(R) and (R’) from (A) and (B), it follows that every identity of 
canonical form is generable by (R) and (R’) from (A) and (B). 

4It is seen from (A) and (B) that the structure of the calculus developed in 
§2 admits of another application, where the elements are non-vanishing num- 
bers and ‘z — 7’ is reinterpreted as meaning ‘r divided by y’. Under this re- 
interpretation, 0 as above defined gives way to 1, the negative gives way to 


the reciprocal, the sum to the product, and the numerical coefficients to the 
corresponding numerical exponents. 
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By §1(b), any equation generable by (R) and (R’) from theorems 
of ordinary arithmetic is a theorem of ordinary arithmetic, and 
hence an identity. In view of §1(a), therefore, any equation gener- 
able by (R) and (R’) from (A), (B), and F is an identity if EF is an 
identity. Now let £ be an identity reducible to canonical form, that 
is, interderivable with some equation F of canonical form. Equa- 
tion F is then an identity, since it is generable by (R) and (R’) 
from (A), (B), and #. But, being an identity of canonical form, 
F is generable by (R) and (R’) from (A) and (B) alone. Hence £, 
derivable as it is from F, is likewise generable by (R) and (R’) 
from (A) and (B). It is thus established that every identity E 
which is reducible to canonical form is generable by (R) and (R’) 
from (A) and (B). 

In order to show then that every phrase identity is generable by 
(R) and (R’) from (A) and (B), it will be sufficient to prove that all 
phrase equations (and hence in particular all phrase identities) are 
reducible to canonical form. This proof occupies the remaining 
section. 


5. Proof of Reducibility. Given any equation 


(ii) a= £, 

the reverse equation (iv) is derivable as follows: 

(iii) Dae (1) 
(iv) DS oy (ii), (iii) 


Hence the following principle is established. 
I. Any equation a = B ts interderivable with its reverse B = a. 


By the terms of a phrase a I mean all letters of a not lying within 
parentheses in a, and all parenthesized phrases in a not lying 
within any broader parenthesis in a. Since parentheses can open 
only to the right of a minus sign [see §2], the initial term of a phrase 
is necessarily a letter. 

From any phrase equation 


(Vv) @—-a—-a- ... —a— Oy... Hon =, 
we can derive an equation (vii) as follows: 


(vi) Sais 8 we TREY — a 
See ae OS™ pea aa ae Sa, (15) 
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(vil) 2—-am—. 1. — ay — Oa 
— @; — Oye 2... Qe i: (vi), (v) 


By the same process, with a; and a,4: interchanged, we can derive 
(v) from (vii). Thus any phrase equation (v) is interderivable with 
an equation (vii) resulting from the permutation of any non-initial 
term of the left member of (v) with its successor. Serial application 
of this principle establishes the interderivability of (v) with the 
equation resulting from any permutation of non-initial terms in the 
left member of (v). In view of I, furthermore, the same will be true 
relatively to the right member. Hence the following principle is 
inferred. 


II. Any phrase equation E ts interderivable with any equation 
resulting from permutation of non-tnitial terms within either mem- 
ber of E. 


From any phrase equation 

(villi) y—-a—a—...—aQn=2— Pi — Po—... — Bn, 

we can derive an equation (ix) as follows: 

(ix) y-a— ... — Om 
=y-y-@—2z-2z)-fi-—... =. Gee 

Conversely, (viii) is derivable from (ix) as follows: 

[viii] Y—-m—... — em =eU—Pi—... — Ba (20), Gx) 

Therefore (viii) is interderivable with (ix). Hence the following 

principle is established. 


III. Any phrase equation is interderivable with a phrase equation 
the «nztral letters of whose members are alike. 


Since parentheses can occur in a phrase only by way of enclosing 
& non-initial term thereof, it follows from II that if any parenthe- 
sized expression in a phrase equation be transported to the end of 
the member in which it occurs, the resulting equation will be inter- 
derivable with the original one. Thus, in particular, any phrase 
equation 


(x) I—-a—-a—.. 


— On 


=@-—fi-f-... =81-¢ =e) ee 


whose members have like initial letters and whose right member 
contains parentheses as indicated, is interderivable with 
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ye —Yay— . .. — Om 

=a—B-—...-8ia—-Bea—... —~&—- C — 9). 

Now from (xi) we can derive an equation (xiii) as follows: 

ae ae fy — ... — Pear — Pa em oe BT 

Se y= . cot Biel Bice — “ee 
2¢-9-4 (18) 

See ey — «2. — Bam Ba Gs BK 
=Z—a—...—AQm—6. (xi), (xii) 


Conversely, (xi) is derivable from (xii) as follows: 


Gav) ©T—@— ... —OmEX—- am ... mo Om 
~9—(8-6—8), (19) 
(xv) E-a— 1... — amp KHer-p—... — bia Bin 
—...—-BP—-f— (6-0-9), (xiii), (xiv) 
(xvl) t—-Pi-—... ~Ba— bia sed ea 
ee So. . . el — a SD 
[xi] Z-a—- . 1. mam KH OK He c ~ — By 
oe s 8 6 =) = es (xvi), (xv) 


Thus (xi), and hence also (x), is interderivable with (xiii). By I, 
then, the reverse of (x) is likewise interderivable with (xiii). 
But in the reverse of (x), (¢ — 6) occurs in the left member rather 
than the right. Hence, in conclusion, a phrase equation whose 
members have initial letters and either of whose members con- 
tains a parenthesized expression is interderivable with a phrase 
equation (xiii) in which the members still have like initial letters 
but in which the parenthesized expression is broken up. Serial 
application of this principle establishes the following one. 


IV. Any phrase equation whose members have like initial letters 
is interderivable with a phrase equation whose members are free of 
parentheses and have like initial letters. 


From the definition of the ‘‘terms”’ of a phrase, it is clear that 
if a phrase is free of parentheses all its terms are letters. Hence, 
by II, any phrase equation H whose members are free of paren- 
theses is interderivable with any phrase equation resulting from 
permutation of non-initial letters within members of EF. Therefore 
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any phrase equation whose members are free of parentheses and 
have like initial letters is interderivable with an equation of 
canonical form. It then follows, by III and IV, that every phrase 
equation is reducible to canonical form. 

In view of §§3-4, this concludes the proof that upon elimination 
of abbreviations all homogeneous linear identities with rational 
coefficients are generable by (R) and (R’) from (A) and (B). 


th’ 


«t Definition of Substitution 


1. Basis. The elements of this study, denoted by italic capitals, 
comprise atoms, at least two and perhaps infinite in number, and all 
finite sequences of such atoms. (The atoms are interpretable as signs, 
for example, and the sequences as rows of signs.) Thus each element 
E is composed successively of possibly duplicative atoms Au, 
Ao, ..., and A,, for some positive integer m, called the length 
of EZ; and an element F composed successively of atoms B, to B, 
will be identical with E if and only if m = n and A; = B; for each 
2 to m. 

Further terminology is self-explanatory. Thus we may speak of 
the kth place of an element; this, in the case of H above, is occupied 
by the atom A;. We may speak of one element as occurring in 
another (as a connected segment thereof), and more particularly 
as occurring 2nitially, internally, or terminally therein; of two ele- 
ments as occurring overlapped in a third; of the number of occur- 
rences of one element in another; and so on. 

Juxtaposition will be used to express that binary operation of 
concatenation whereby any elements EZ and F’, composed as above, 
are put end to end to form that element EF which is composed suc- 
cessively of the atoms A;, As, ..., Am, Bi, Bo, ..., and By. 
The element £ is itself describable, in terms of concatenation of 
its atoms, as AiA2 . . . Am; parentheses are suppressed, as here, 
In view of the obvious associativity of concatenation. 

It is clear that the length of EF, for any EF and F, exceeds the 
lengths of E and F and equals their sum; also that E occurs 
initially and F terminally in EF, while EF occurs neither in E 


Copyright American Mathematical Society 1936 from the Bulletin of the 
American Mathematical Society, Vol. 42, No. 8, 561-569. Footnote 1 is newly 
added. 
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nor in F; also that atoms are length 1, and that an element G 
is an atom if and only if there are no elements E and F such 
that G = EF. 


2. Substitution. The purpose of this paper is a formal defini- 
tion of substitution in terms exclusively of concatenation and the 
following elementary logical devices: identity, applied to elements; 
the truth functions; and quantification with respect to elements.! 
The notation will be as in Principia Mathematica: the sign ‘=’ 
for identity, the signs ‘~’, ‘’, ... , for the truth functions, 
and prefixes of the forms ‘(X)’ and ‘(3X)’ for quantification. 

The proposition to be formulated is expressible verbally thus: 
W is the result of substituting X for Y throughout Z; briefly, 
sub(W, X, Y, Z). When the elements are interpreted as signs and 
rows of signs, the notion under consideration is the notational 
substitution which figures so prominently in metamathematics. 

In the general form in which substitution is here conceived, 
its formulation is complicated by the fact that the element Y for 
which substitution is made need not be an atom and hence may 
have overlapping occurrences in Z. Obviously we cannot in general 
replace each of two overlapping occurrences of Y by X, since re- 
placement of one occurrence will mutilate the other occurrence. 
To this extent the notion of substitution is ambiguous. The am- 
biguity is resolved by stipulating that in case of overlapping occur- 
rences left is to prevail over right; thus the result of substituting 
X for TT in TTT is to be XT rather than TX. The result of sub- 
stituting X for Y throughout Z is then describable, in general, as 
the result Z’ of putting X for each of these occurrences of Y in 
Z: the first (leftmost); the first which begins after the end of that 
first; the first which begins after the end of this second; and so on. 
In the trivial case where Y does not occur in Z, Z’ is of course Z. 


8. Formal Definitions. The following abbreviations are adopted: 
Di. UinitV =a: U = V.V: (aT). UT = V. 


1 Gédel showed (1931, 184, 192) that substitution was definable in elemen- 
tary number theory; and we shall see in the next paper, below, that concatena- 
tion theory eked out with names of the atoms amounts to elementary number 
theory. Still the present construction retains some distinctive traits: (1) It 
uses an apparatus presumably weaker than elementary number theory, 
namely, concatenation without names of atoms. (2) The elements substituted 
for are of any length; Gédel’s are atoms. (3) The presentation is more direct, 
not being involved with other developments. 
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D2. Win V oar? Vanit VV Gl). TU mevy. 
D4. @(U, V,M,N).=a. MUMVM inN .M ~in UV. 


D5. subl(U, 0) Y, VY) = df: 
U=X.V=Y.V:(8T).U =TX.V=TY :~(as). YSinV. 


Obviously ‘U init V’ may be read ‘U occurs initially in V’, and 
‘U in V’ may be read ‘U occurs in V’. Again, ‘subl(U, X, Y, V)’ 
tells us that Y occurs in V terminally and only so and that U is 
the result of putting X for Y in V; this is seen as follows. For Y to 
occur terminally in V it is necessary and sufficient that either 
V =Y or Q7).V =TY. Where V = YF it is clear further that 
Y does not occur in V otherwise than terminally; where (47) .V = 
TY, on the other hand, in order that Y not occur in V otherwise 
than terminally it is obviously necessary and sufficient to add that 
~(aS) . YS in V. In general, therefore, for Y to occur in V termi- 
nally and only so it is necessary and sufficient that 


V=Y.V:(7).V =TY:~ @S). YSin J. 
Now if U is the result of putting X for Y in V, U will be X or TX 
according as V is Y or TY. D5 thus yields the described meaning. 
The definition of substitution follows: 
Do eubiW, xX, Y,Z) ais. Y¥ ~inZ.W = ZVI. 
(9G) (GH) ::: Q)W) =. (UY) s:subl(U, X,Y, V) .D 
Vem Z.).O(U, V, M,N) :. (S)(L) : OC, T, M, N) 
TV init Z .D.O(SU, TV, M, N) ::3.0(G, H, M, N) 
ssW=G.Z=H.V:€AK).Y~ink.W=GK 
of = AK. 


When abbreviations introduced by D1-D5 are eliminated in favor 
of their definientia, the definiens in D6 is seen to involve only con- 
catenation and the elementary logical devices mentioned in §2. 


4. Demonstrandum. It remains to show that D6 yields substitu- 
tion in the sense of §2; that is, that sub(W, X, Y, Z), in the sense 
of D6, if and only if W is Z’ as of §2. 
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Supposing X, Y, and Z given as constants, we define as follows: 
Dtl. O@7, N) .=ac:: (U)(V) :: subl(U, X,Y, V) .D:. 
ViwmiZ .». OU, V, M,N): (S)(T) : OG, T, WM 
- FV int Z.D.O(SU, TV, M, N). 
Dt2. wW(F, G, H) .=ar:. (MW)(N) :&(M, N) .D. OG, H, M, N) 
».FP=G4G.Z=H.V:AK).¥Y~ink.F =GK 
2 = HK. 


By §1, (U)(V).XUXVX ~ in X; by D4, then, (U)(V). ~O 
(U, V, X, X), so that 


Ci MGV) :. Vineet Z.D. OU, Vi A): 
(2) (G)\(A) :. ®(X, X) .D.O(G, H, X, X): 
and, trivially, 


3) COMVY(S)(T) : OS, T, X, X) . PV init Z 
i >. O(SU, TV, X, X). 


By Dtl, (1), and (38), 
@(X, X).=:(U)(V) :subl(U, X, Y, V) .D.~. V init Z, 


whence ~®(X, X).D.(U)(aV). subl(U, X, Y, V).V init Z, 
and therefore, by D5, 


~6(X, X).D:. (AV) 2 V = Y.V:@T).V = TY: V init Z, 


that is, ~®(X, X).D: VY inttZ.V: (aT). TY init Z, which is to 
say, by D2, 


(4) ~@(X, X).D. YinZ. 


By Dt2, (@)(A) :. ¥(W, G, H) .D: &(X, X) .D. O(G, H, X, X), 
whence, by (2) and (4), 


(G)(E) : ¥(W, G, H).D. YinZ, 
that is, (4G) (4H) . ¥(W, G, H) .D. Y tn Z, or, equivalently, 
(5) (@G)@€H).v(W, G, Z).=:YinZ:(€G)G) .v(W, G, A). 
If Y nes not occur in Z, then, by §2, Z’ is Z. Hence 
(6) YrwinZ. W=2Z.=.¥Yr~mZ.wW=lZ'. 


.~. Vine, 
.~0(X, X), 
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By D6, Dt1, and Dt2, 
sub(W, X, Y, Z). 


>YrwinZ.W=Z.V:(aG@A). 
w(W, G, H). 

Hence, by (5) and (6), 

7 sub(W,X,Y,Z).=:YrwinZ.W=2'.V: Ying 

(7) : (4G)(aH) . V(W, G, H). 


Now if we can prove that 
I) YiunZ.D:A@Q@Q@AA).Vv(W,G, zZ).=.W=2', 


so that Yin Z : (AG)(AH) . V(W, G, H) :=.YinZ.W = 2’, then 
from (7) we shall have 


sub(W, X, Y,Z).=.YrwnZ.wWw=Z7’.V.YinZ.W=2Z, 

that is, sub(W, X, Y, Z).=.W = Z’ which was to be proved. 
It thus remains only to establish (1). 

&. Proof of (1). Given that Y occurs in Z, it is to be proved that 


(4G) (aH) . V(W,G, H).=.W = 2". 


Consider the following segments Z; of Z. Z, extends from the 
beginning of Z to the end of the first occurrence of Y in Z; Z_ ex- 
tends from the beginning of Z! to the end of the first occurrence 
of Y in Z!, where Z' is Z deprived of its initial segment Z,; and, 
in general, 7:1; extends from the beginning of Z‘* to the end of 
the first occurrence of Y in Z*, where Z‘ is Z deprived of its initial 
segment Z,Z, . . . Z;. By construction, Y occurs in each Z; termi- 
nally and only so; in view of §3, then, where the Z’; are the results 
of putting X for Y in the respective Z,, 


(8) (7) . subl(Z’;, X, Y, Z;). 


Let ‘Zio ee Zi and WIRING: ee Vi be written ‘230 and "ZL l . 
Thus 


(9) AL= 41.44! = 2%, 

(10) (@) . Zina! = Zi!Zear. Zora! = DZ sa, 
and, by construction, 

(11) (¢) . Z;! init Z. 


We exhaust the segments Z; only when we reach a point in Z 
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beyond which Y occurs no more. Thus, where Z, is the last of 
the Z 4; 


(12) (K):2=2Z2,!K .D.Y ~ink. 


Quantification with respect to subscripts, as in (8)— (11), refers 
of course only to the n or fewer significant values; for example, 
‘() . 0(Z;, Z's)’, ‘(A1) . 6(Z;, Z's)’, and ‘(¢). (Zi, Zi41)’ are short 
for 


“(Z1, 211) « $(Z2, 22)... (Sn, Zn)’, 

‘o(Z1, 2) V o(Z2, 22) V . « « OS, Z'n)’, 
and 

(Zi, Ze) « 6(Ze2, Zs). . » « P(Zn-1, Zn)’. 


By §1, there are distinct atoms; let A and B be any two such, 
and, where k is the length of Z’,!Z,!, let C= ABB...Btok 
occurrences of B. Clearly 


(13) (i). C~inZ' IZ), 


since the length of C is greater by one even than that of Z',!Z,,!. 
Now it will be shown that, for any elements G and H such that 
C ~ in GH, there are no occurrences of C in CGCHC except the 
three indicated ones. First, no two occurrences of C can overlap; 
for, if they are distinct, one must start later than the other and 
hence must start at a non-initial place of the other; but all these 
non-initial places are occupied by B, whereas C’ starts with A. 
Hence if there is an occurrence of C in CGCHC other than the 
three indicated ones, it overlaps none of the latter; it therefore 
lies wholly within G or H. But it cannot, since, by hypothesis, 
C ~in GH. Consequently there are none but the three occur- 
rences of C in CGCHC. In particular, then, it follows from (13) 
that there are none but the three occurrences of C in CZ’,!CZ;!C. 
Since, where 


(14) D = C4NICZNCCZ"e'CZe1C 2. . CZ’ nICZ AIC, 


D is made up wholly of segments of the form CZ;!CZ;!C, any 
occurrence of Cin D must lie either wholly within or partly within 
and partly beyond such a segment. But in the latter case the 
occurrence of C in question would overlap the terminal occur- 
rence of C in CZ’;!CZ,!C, whereas we saw that no two occurrences 


MCC — a a  e _ = 
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of C could overlap. Hence every occurrence of C in D lies wholly 
within CZ’;!CZ;'!C for some 7. But CZ’;!(Z;!C was seen to contain 
only the three indicated occurrences of C. Therefore D contains 
only the 3n occurrences of C indicated in (14). 

Where C ~ in GH, CGCHC contains, as was seen, only the three 
indicated occurrences of C, and is hence describable as beginning 
and ending with C and containing just three occurrences of C and 
no two adjacent. But, since D contains none but its 3n indicated 
occurrences of C, inspection of (14) shows that the only segments 
of D fulfilling this description of CGCHC are the segments CZ’;! 
CZ;!C for the various 7; any other segment of D beginning and 
ending with C and containing three occurrences of C would con- 
tain two adjacent. Hence, if C ~ in GH and CGCHC in D, CGCHC 
must be CZ’;'!'CZ;!C for some 7; then G and H must be Z’;! and 
Z;\. Thus, in view of D4, 


(15) (G)(H) : O(G, H, C, D) .D. At). G = 2';!. 0 = Z;!. 
Conversely, by (14), (13), and D4, 

(16) (i). O(2’,!, Z;!, C, D). 

By (15) and (16), 

(17)? (@)(A) : O(G, H, C, D) .=. (4) .G = 2',!.H = Z;1. 


If sub1(U, X, Y, V), then, by §3, Y occurs in V terminally and 
only so and U is the result of putting X for Y in V. But, if V con- 
tains Y just thus and if further Z;!V init Z, then V must be Z;41, 
so that U becomes Z’;4:; and where U and V are Z’;4; and Z;4: it 
follows from (10) and (16) that 0(Z2’,!U, Z;!V, C, D). Thus 

2'This exemplifies a general technique, within a concatenation system, for 
eliminating reference to a finite class or relation of elements in favor of refer- 
ence to two properly selected elements C and D. Where a is the m-adic relation 
(or class, if m = 1) exhibited by just the elements Q:1, Qiz,. . . , and Qim in 
that order, for the various 7 from 1 to n, and k is the length of the element 


QuQi2 . . . QimQaQee . . . Qam . . » QniQne . . » Quam, 
and Cis ABB .. . B to k occurrences of B, and D is 
CancOa,. .. COmCCQniCOn ... CQmC.«.-. CQmCQn . . . CQnnl, 
it can be shown that 
CG,CG, ... CGnC in D.C ~inGiG. . . . Gm 


if and only if Gi, G,, . . . , and G,», exhibit in that order the relation a. In (17) 
this equivalence is proved for the special case where m = 2; G, H, Z’; !, and Z;! 
answer to Gi, G2, Qi, and Qy2. 
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(U)(V) (2) :sub1(U, X, Y, V).2:!V intt Z .D.0(2'1U, Z,!V, C, D), 
and hence 


(U)(V) :. sub1(U, X, Y, V) .D:(S8)(T) : (2). 
S=Z2'!.T =Z,!.TV init Z.D. O(SU, TV, C, D); 
that is, by (17), 
(18) (U)(V) :. subl(U, X, Y, V) .D: (S)(T) : 
O(S, T, C, D). TV tat Z .D. OSU, TV, Crm: 


Again, if Y occursin V terminally and only so, and V init Z, then 
V must be Z:; thus, where sub1(U, X, Y, V) and V init Z, U and 
V will be Z’; and Z;. But, by (9) and (16), O(2'1, Z1, C, D). Thus 


(U)(V) :subl(U, X, Y, V). V init Z .D. O(U, V, C, D). 
From this and (18) it follows, by Dtl, that ®(C, D). By Dtz2, then, 


(G)(AZ) :: VW, G, H).D: OG, 7, C,D).W=G4.zZ2=H.V: 
(AK).Y~inK.W=GK.Z = dHkK, 


that is, by (17), 


(GG) : 8@,G, A) .D: G).G=7 ! =A 
W=G.Z=H.\V:(aK).Y~inKkK.W=GK.Z=4Hk, 


and hence 


(4G) (aH). U(W, G, H).D:() WH 7.2 = ZA.V: 
(AK). YwinK.W=2'1K.Z = ZK. 


But, whether Z = Z;!or (AK). Y ~inK.Z = Z;,!K, 7 must ben: 
for in neither case does Z contain any occurrence of Y after Z;!. 
Thus 


(19) (AG) (AH) : v(W, G, H) >.W = ey TEN) 
(QK).W = ZK .Z = Z,1K, 


Of the occurrences of Y in Z, the one in Z; is, by construction, 
the first, the one in Zz is the first which begins after the end of that 
first, and so on. Therefore if each Z; is supplanted in Z by Z’;, so 
that Z,,! is supplanted by Z’,!, the result will be Z’ as described in 
§2. Z’ is thus Z’,! or Z’,!K according as Z is Z,! or Z,!K. By (19), 
then, 


(20) (3G) (aH) . V(W, G, H) .D.W = 2". 


. lle —e” ee ———————————E 
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If $(M, N), then, by Dtl, 
subl(Z",, Xx, i ee Z1) ) 2 init Z Sie O(2'1, “Zi, M, N) 
and 


(4) 2. subl(Z' sas, X, Y, Zias) D1 O(Z'sl, Zit, M, N). 
ZLi!Ziar anit Z ae O(2'3!Z' s43, “Zi'!Zi41, M, N). 


But these two results reduce, in view of (8)—-(11), to 
O(2'1!, Z1!, I, N) 
and 
(@) :O(2’,!, Z:!, MW, N) .D. O's4a!, Ziai!, WU, N), 
and from these it follows that O(Z’,!, Zn!, M, N). Thus 
(21) (M)(N) : (M1, N) .D. O(2'n!, Zn!, M, N). 


By (11) and D1, Z is Z,! or else Z,!K for some K; and, as seen 
above, Z’ is in these respective cases Z’,! and Z’,!K. Moreover, in 
view of (12), Y ~ in K. Thus 


Z=Z',!.2=Z2,!.V:€K).Y~ink. 
Z=Z2', AK .Z = Z,3K. 


From this and (21) it follows, by Dt2, that (Z’, Z’,!, Zn!). 
Hence 


W = Z'.D. (AG)GH) . VW, G, BH), 
and consequently, by (20), 
GG)GH).¥0V,G, 2) .=W=7’, 


which was to be proved. 


« Concatenation as a Basis 
for Arithmetic 


1. Introduction. General syntax, the formal part of the general 
theory of signs, has as its basic operation the operation of con- 
catenation, expressed by the connective ‘~~’ and understood as 
follows: where x and y are any expressions, z~ y is the expression 
formed by writing the expression z immediately followed by the 
expression y. E.g., where ‘alpha’ and ‘beta’ are understood as names 
of the respective signs ‘a’ and ‘f’, the syntactical expression 
‘alpha beta’ is a name of the expression ‘af’. 

Tarski! and Hermes have presented axioms for concatenation, 
and definitions of derivative syntactical concepts. Hermes has also 
related concatenation theory to the arithmetic of natural numbers, 
constructing a model of the latter within the former. Conversely, 
Gédel’s proof of the impossibility of a complete consistent sys- 
tematization of arithmetic depended on constructing a model of 
concatenation theory within arithmetic. 

Like Tarski and Hermes, I also have used concatenation as a 
basis for various constructions;? but my constructions differ essen- 
tially from theirs in presupposing no auxiliary logical machinery 
beyond the elementary level: truth functions, quantification, and 
identity. It can be easily shown to follow from my constructions 
that the elementary arithmetic of natural numbers (elementary in 
the sense of using none but the aforementioned logical auxiliaries) 


Reprinted from the Journal of Symbolic Logic, Volume 11, 1946. 
11935, 287ff. 
2 Preceding paper; “On derivability”; Mathematical Logic, Chap. 7. 
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can be embedded in the elementary theory of concatenation. The 
only proviso is that the atomic components from which concatena- 
tions can be formed must be at least two in number, and distin- 
guishable by name; e.g., alpha and beta. 

The present paper, undertaken at Professor Tarski’s suggestion, 
will establish the above more explicitly and in a strengthened form: 
it will be shown not only that the elementary arithmetic of natural 
numbers can be embedded in the elementary theory of concatena- 
tion, but that it can be so embedded as to exhaust the latter, render- 
ing the elementary theory of concatenation and the elementary 
arithmetic of natural numbers identical. 

Actually it will be more convenient to consider, in place of the 
arithmetic of natural numbers, that of positive integers. The con- 
clusions reached can afterward be transferred to the arithmetic of 
natural numbers, as will be seen. 

The paper will presuppose no acquaintance with previous work 
on concatenation. 

Let us now turn to a more exact statement of what is meant by 
the elementary theory of concatenation and the elementary arith- 
metic of positive integers, and a more exact statement of what is to 
be proved about them. 

The theory of concatenation need not be thought of as syntacti- 
cal in subject matter; it may be regarded as having to do with 
finite sequences of any manner of objects. The objects, called atoms, 
are themselves considered sequences, viz., sequences of length one; 
and concatenation is the operation of laying sequences end to end 
to form new sequences. Thus, where x is the sequence consisting 
successively of the atoms a, b, b, a, and c, and where y is the 
sequence consisting successively of b and a, the sequence x7 ~y will 
consist successively of a, b, 6, a, c, b, a. Parentheses will be sup- 
pressed by construing ‘7 y~ 2’ as ‘(x -y)~ 2’; clearly the grouping 
is in fact indifferent. 

The elementary theory of concatenation comprises that body of 
theory which can be expressed in terms of concatenation, identity, 
names of the several atoms, truth functions, and quantification 
with respect to variables ‘zx’, ‘y’, ‘2’, .. . whose values are 
sequences. The theory varies in detail according to the number of 
atoms assumed. Thus, if we suppose the atoms to be just two in 
number and designated by ‘a’ and ‘b’, the formulas of the elemen- 
tary theory of concatenation are the following: 

(i) identities, consisting of ‘=’ flanked by terms each of which is 
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either a simple constant ‘a’, or ‘b’, or a variable ‘x’, ‘y’, ‘2’, . . . , 


or an expression compounded of such simple signs by one or more 
applications of the concatenation sign; 

(ii) all compounds formed from such identities by means of 
truth-functional connectives and quantification with respect to 
variables ‘x’, ‘y’, ete. 

The elementary arithmetic of positive integers comprises that body 
of theory which can be expressed in terms of sum, product, power, 
identity, names of the several positive integers, truth functions, 
and quantification with respect to variables ‘x’, ‘y’, ‘2’, . . . whose 
values are positive integers. Thus the formulas of the elementary 
arithmetic of positive integers are the following: 

(i) identities, consisting of ‘=’ flanked by terms each of which is 
either a simple constant ‘1’, ‘2’, ... , or a variable ‘x’, ‘y’, ‘’, 

. , OF an expression compounded of such simple signs by one or 
more applications of the notational forms ‘x + y’, ‘x X y’, ‘x” of 
sum, product, and power; 

(ii) all compounds formed from such identities by means of the 
truth-functional connectives and quantification with respect to 
variables ‘zx’, ‘y’, etc. 

It will be proved that a model of arithmetic can be constructed, 
in terms of finite sequences, in such a way that all the notations of 
the elementary arithmetic of positive integers become definable in 
the notation of the elementary theory of concatenation of two or 
more atoms. 

Identification of the positive integers with sequences may be 
either unilateral, in the sense that the integers are exhaustively 
identified with an inexhaustive subclass of sequences, or bilateral, 
in the sense that the integers and sequences are so identified as to 
exhaust both the integers and the sequences. The construction of 
arithmetic within the theory of concatenation will be carried 
through first in unilateral fashion (§3), since this is easier. But the 
bilateral construction (§4) is of especial interest in that it makes 
the elementary theory of concatenation and the elementary arith- 
metic of positive integers completely intertranslatable. And it 
shows that concatenation can be interpreted as a purely arith- 
metical operation, in terms of which the other operations of arith- 
metic are definable. 

After proving these things with regard to the arithmetic of posi- 
tive integers, we can infer the same with regard to the arithmetic 
of natural numbers; for it is known (and will be re-established 
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briefly in §5) that the arithmetic of natural numbers is bilaterally 
constructible within that of positive integers. 

The definitions comprised in all these constructions will com- 
monly be, in a certain sense, indirect; that is, free use will be made 
of the descriptive notation ‘(.7)( ... )’, which is not eliminable 
by any direct substitution of expressions built up of primitive 
terms. This notation has not been cited in the above inventory of 
the notations of the theories under consideration, because it is well 
known that the presence of the notational machinery of quantifica- 
tion, truth functions, and identity renders the descriptive notation 
eliminable from any theorem or other formula in which it occurs. 


2. Finite Relations. The chief circumstance which makes possible 
the construction of arithmetic from concatenation theory is that 
single sequences can be made to do the work of all finite relations of 
sequences, by a method which will now be set forth. I shall assume 
here that our theory of concatenation involves at least two atoms, 
a and b; there may be more. 

A finite relation may be thought of first as a finite collection of 
ordered pairs, and represented graphically as a two-column list. 
Consider now any such relation; let it pair vw; with 2, and uw with t%, 
and so on to U, and v,. (The elements 1, v1, te, etc., are them- 
selves any sequences.) Let z be a certain ¢ally, i.e., a certain 
sequence consisting wholly of occurrences of a. (The reason for 
calling such a sequence a tally will become evident in §4.) Further, 
let this tally z be longer than any tally appearing within any of the 
SEQUENCES 21, V1, Ue, etc. Now form n sequences, one corresponding 
to each of the n pairs of the original relation, as follows: 

WwW, = 0° -27~b us bz 1 Oz, 
We = 0°27" un’ “bz bv" bz, 


tn— b> 2b Wo b 2b uu, bo Bo 
and lay all these end to end in turn to form a single sequence: 
w= Wy We". ; Wy, 
Next let us observe certain peculiarities of w. (i) The only occur- 
rences of z21n w are the 8n occurrences of z which are shown explicitly 
an the above expansions of Wi, We, . . . , Wa; there are no additional 
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hidden occurrences of z. This is seen as follows: There can be no 
additional occurrence of z hidden within any of the occurrences of 
U1, V1, Us, etc., because z exceeds any tally therein; nor can there be 
any additional occurrence of the tally z partially overlapping upon 
one of the explicitly shown occurrences of z or of w, v1, UW, ete., 
because each of those explicitly shown occurrences is insulated in 
front and back by the atom b, which cannot occur in a tally. The 
possibilities are thus exhausted. 

(11) The only segments of w which have 2° ~b just before them and 
b~~z gust after, and do not themselves contain any occurrence of 2, are 
the occurrences of 1, V1, U2, elc., which are shown explicitly in the 
above expansions of Wi, We, ... , Wa. This follows from () on in- 
spection of the expansions of w,, We, . . . , Wa. 

(ail) If x and y are sequences netther of which contains any occur- 
rence of z, and if 


(1) 20 Lc Vee Yee Oe 


occurs as part of w, then x and y must be respectively u; and »; for 
some 7. For, by (ii), « must be one of 2%, 2%, we, etc., and y must bea 
later one of them. Yet the occurrences of x and y in question can- 
not be within different ones of the segments w;, We, . . . , Wa of 
w, because if they were they would be separated by at least 
bz bz b instead of just b~-27~b. So x and y must be respec- 
tively u; and v; for some one 7. 

In view of (iii) we have the following result: instead of saying that 
x stands to y in our original finite relation, we can omit mention of that 
relation and speak instead of the sequence w, saying that (1) is part of 
w and that x and y contain no occurrences of z (2 being explained in 
turn as the longest tally in w). 

Let us write ‘w(x, y)’ to mean that (1) is part of w and that x 
and y contain no occurrences of z (where z 1s explained as the long- 
est tally in w); then what we have found is that we can construct a 
sequence w such that (x)(y)[w(a, y) =. 2 stands to y in our original 
finite relation]. 

Preparatory to setting up the formal definition of ‘w(x, y)’ we 
shall need to introduce the notation ‘C’, meaning ‘occurs in’, ‘is 
part of’. It is definable within concatenation theory as follows: 


Dl. t C y =az (32) (3w)(a = y.V. 
2 2=Yy.V.x W=y.V.2 & Dee 
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Thus ‘x C y’ means that the sequence z is a continuous part (or 


all) of the sequence y.® 
Next the notation ‘Tz’, meaning that z is a tally, i.e., contains 


only a’s, is readily defined: 
D2. Tz =a (x)\(x Cz.D.aC2). 


To say that z is the longest tally in w is to say that zis a tally and 
is part of w and that every tally occurring anywhere in w reappears 
as part of z. In symbols this becomes: 


Tz.2Cw.(v)(Tv.»Cw.d.0 Ca), 
which is equivalent to: 
(2) (v)(Tv.0 C w.=.0 C2). 
So now we are ready to define ‘w(z, y)’ as meaning that there isa 


sequence z which satisfies (2), and is not part of x or y, and is such 
further that the sequence expressed in (1) is part of w. 


D3. w(z, y) =ar (Jz)[()(To.0 Cw.=.0C2z).~ (Ca). 
m2 Cy) 2OT TOOT Cw). 


The reasoning of the present section has shown that, for every 
finite relation of sequences, there is a sequence w such that 


(x) (y)[w(x, y) =. 2 bears the relation to y]. 


We have seen how, given a list of the pairs comprising the relation, 
such a sequence w can actually be constructed. 


8. Unilateral Construction. A convenient way of getting a model 
of the realm of positive integers within the realm of sequences 
based on two or more atoms a, b,... is by identifying each 
integer m with the tally which comprises n occurrences of a. Thus 
1, 2, 3, . . . become definable respectively as a, a” “a, a” a” ~a, 

. ; and to say that z is a positive integer is to say simply that 
Ta. 

Where x and y are positive integers, clearly ‘x C y’ amounts to 

‘e = y’; and xy is the sum of x and y. 


’ The definiens in D1 could be simplified to ‘(3z)(3w)(z7 “x7 ~w = y)’ if we 
were to assume a null sequence among the values of our variables. However, 
I have chosen rather to repudiate the null sequence throughout the present 
paper, lest it be thought to be an essential assumption. 
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The product x X y is the sequence which we would get by laying 
down the sequence x over and over, end to end, y times—i.e., as 
many times as a occurs in y. (This explanation makes good sense 
even where z is not an integer; but this is an incidental extension 
that need not interest us.) Let us now turn to the problem of con- 
structing an actual definition to this effect. 

It will first be shown that, for any sequence z, the following con- 
dition implies that z is the product x X y in the desired sense: 


(3) Gioiwa, z). (s)@OlwG t) Ds "a 7 — 
(Ju) Av) (wlu, v).s = a u.t = 27 v))}. 


This is seen as follows. (3) says there is a sequence w such that 
(4) wy, 2), 
GY Ow, i) Dis=a.t=2.V 

(Ju) (Jv) [w(u, v).s = a u.t = x27 vhf. 
By (4) and (5), 
OG) wSa.2c= x. V GeGowat).y =a u.z=2 or 


Now if the sequence y is an atom, and hence no concatenation, the 
second alternative in the alternation (6) must fail; and then the 
first alternative in (6) tells us that y is a and z is x. If on the other 
hand y is longer than an atom, then the second alternative in (6) 
tells us that y and z begin respectively with a and 2, and also that 
the remainders y’ and 2’ of y and z are such that 


(7) wy’, 2’). 
By (7) and (5), 
(8) y  =a.2’=2.V (Au) Qv)[wly, v) .y’ = au. 2 = x4). 


Continuing thus, for as many steps as there are places in the origi- 
nal sequence y, we see finally that y must consist wholly of a’s and 
that z must consist of a concatenation of a like number of 2’s; in 
brief, y is a positive integer and 


(9) | z2=xXy. 


It will now be shown conversely that, for all integers x and y, 
(9) implies (3). Consider the following list of pairs: 
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a x 

aa xu x 

a aa - coe 
y cxXY 


(We know, from the intended sense of ‘x X y’, that the sequence 
reached in the right-hand column opposite y will in fact be x X y.) 
Now the principle stated at the end of §2 assures us that there is a 
sequence w such that, for any sequences s and ¢, ‘w(s, t)’ holds if 
and only if s and ¢ are paired in the above list. From inspection of 
the list, then, it is evident that (5) holds, and also that 


(10) wy, xX y). 


From (10) and (9) we get (4), and from (5) and (4) we get (38). 
The product x X y, therefore, is definable as the one and only 
sequence z fulfilling (3). 


D4. 2X y =a (0:2) €w) {wly, 2). SOlwls, ) Dis =a.t=-2 
.V (du) (av) (wu, v).s = a u.t = 27 v)J}. 


The power x” can be defined quite analogously. The only differ- 
ence is that whereas the product x X y was a continued concatena- 
tion 7 2. . .~ x to y occurrences of x, the power x” is a con- 
tinued product x Xx X...Xx to y occurrences of x. So the 
definition of x” differs from D4 only in using ‘x X v’ instead of 
‘cv’ at the end. 


D5. 2 =a oz) Gw){wly, z) . (s) Olw(s, ) Dis =a.t=2.V 
(Au) (Av) (wlu, v).s = a u.t = 2 X vj}. 


The definitions D4 and D5 are of course of interest only where 
the sequences x and y are positive integers; what they generate in 
other cases need not concern us. It may, however, be remarked in 
passing that, where x is not an integer but y is, x X y continues to 
be the sequence x 2. . .~ x to y occurrences of wz. 


4. Bilateral Construction. We now turn to an alternative method 
of identifying the positive integers with sequences; this time a 
bilateral method, i.e., one which uses up all the sequences. 

Up to now we have been able to leave the number of atoms un- 
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specified, requiring only that there be at least two. For the bilateral 
construction, however, we need to know how many there are. Any 
specified finite number of them would serve, provided there are at 
least two. The construction will first be set forth for the case where 
there are exactly two atoms, a and b. Afterward we shall see how to 
adapt the method to any larger finite number of atoms. 

Bilateral identification of the positive integers with the finite 
sequences of atoms a and b can be accomplished by arranging the 
sequences lexicographically; i.e., in order of length and alphabeti- 
cally within each length: 


1B 3 4 5 6 7 8 
@baaabob ab b@a ea as 
(it) 9 10 i 


a ba a bb 0” @ ign 
In this way not only do the integers receive an interpretation as 
sequences, but also conversely, the finite sequences of a and 6 all 
receive an interpretation as positive integers, and concatenation 
thus itself becomes an arithmetical operation on positive integers. 
In fact, concatenation becomes expressible within elementary arith- 
metic, as follows: 


ey =yt ile X Oz) Fw)(e = 2°. 2 Sy+1<24+2)]. 


This identity will be left unsubstantiated ;4 our avowed business 
lies in the opposite direction, viz., to find ways of defining z + y, 
x X y, and x’, conformable to (11), within the theory of concatena- 
tion. Our business is to show that the new arithmetical operation of 
concatenation is an arithmetical operation in terms of which the 
more familiar operations of sum, product, and power can be defined. 
Though the sequences composed purely of occurrences of a@ no 
longer exhaust the positive integers as they did in §3, they will con- 
tinue to play an important role under the earlier name of ¢allies. 
Definitions D1-3 will be retained. D4 and D5, no longer available 
as definitions of products and powers of positive integers, will still 
be useful in an auxiliary status as defining what may be called the 
tally products and tally powers of tallies; they are accordingly re- 
produced here with accented designations and modified notations: 
D4’, 2 Xry =ar (12) Aw) (wy, z). (s)([w(s, -) Dis =a.t=-2 
V Gu) Gv) wt, vy) .s =a u.t == oie 
‘The reasoning behind it is evident from “On derivability,’’ 115, where 


the present identification of integers with sequences is so rephrased as to ex- 
hibit its relation to the dual system of numeration. 
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D5’. 2 Ary =as (12) Aw) {wly, 2). S)(O[wG, ) Dis=a.t=2 
.V (du) (av) (w(u, v).s = a u.t = x X,v))}. 


A tally consisting of n occurrences of a will be called the tally of n. 
Where z is any positive integer (hence any sequence whatever, 
under the method of the present section), the tally of z will be 
referred to as rz. It is definable in concatenation theory in the light 
of the following considerations. 

Hereunder are listed the successive positive integers from 1 to z, 
paired off with their tallies. 


a a 

b aa 

a a aa a 

ab a a awe 

Db” 1a a ang a” a 

bb a a &@ a a @ 

a a” 4 a2 @ a a @a 
aa” ~b ao ae a a a a@ a 4G 


L TL 


A method of generating the column on the left, comprising the 
integers in the normal order 1, 2, 3, . . . , is described by the fol- 
lowing rules: 

(i) Start with a and b. 

(ii) If « appears in the column, put u~~a one place more than 
twice as far down the column, and put u~~b next after (e., 
u a= 2u+1andu ~b = 2u + 2). 

From (ii), together with the nature of the right-hand column 
(the tallies), it is evident that, where v appears opposite u (so that 
v = ru), what will appear opposite u~ a is v~ v~ a; and what will 
appear opposite u~ “bis v” va’ a. 

In view of these considerations, it can be shown that the defini- 
tion: 

DG. tear hy) Gw){w, y) . (s)\OlwG, ) Dis =a.t=e@.V. 
=b.t=a~a.V (au)(av)(w(u, v): 
Sa €2h= Ves — 27h. t= vo a al} 
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defines the tally of x in the intended sense. The reasoning which 
shows that the condition following ‘(.y)’ in D6 implies that y is 
7x in the intended sense, and vice versa, is parallel to the reasoning 
which showed in §3 that (8) implied (9) and vice versa. 

Now the definitions of x + y, x X y, and z¥ are immediate. We 
can define x + y as the integer whose tally is the concatenate of the 
tallies of « and y; we can define x X y as the integer whose tally is 
the tally product (as defined in D4’) of the tallies of zx and y; and 
we can define x” analogously. 


D7. Bee =a Cele = Fe 60). 
D8. xX y =at (12) (72 = re X, ty). 
D9. xy =as (12)(72 = 7x A, Ty). 


The methods of construction which have just now been explained 
for the case of two atoms a and b will now be adapted to the general 
case of k atoms ai, d2, . . . , ax. Weare to think of k as some speci- 
fied number, finite and greater than 1. 

Bilateral identification of the positive integers with the finite 
sequences of atoms di, dz, . . . , a can still be accomplished by the 
old expedient of arranging the sequences in order of length and 
alphabetically within each length: 


Pe...k k+l k+2... Sh QE... 
Qa, ag... ay” ay Qi” ap Se 4 ai” ax dz” ay ome 
+k k= > ke 
ay” ax Qi” ai" ay ae 
This generalization requires us to generalize the instructions (i) 
and (ii) above to read as follows: 

(i) Start with a, ae, ... , and ay. 

(ii) If w appears in the column, put u~ a, one place more than k 
times as far down the column, then put wu~ az next, and so on to 
UW Ax. 

Thereupon D6 has to be generalized correspondingly. The follow- 
ing auxiliary notation will be used: 


(xc)e =aru7 2, (x)3 =a ee, etc. 
The generalized version of D6 then appears as follows: 
rx =ar (sy) (Aw) {w(2, y) . (s) O[w(s, 2) Dis = a.t = a.V.s = & 
j=) V.s = a,.t= (a),.V (au) (av) (wy, v) : 
=u a.t= (v), Qa V.s =U @.t= (v). (a1)2 .V ee 
iV.S8 =U Gye =e es 
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The definitions D7-9 now carry over to the general case without 
modification. 


6. Natural numbers. We saw in §3 how the elementary arithmetic 
of positive integers can be constructed unilaterally within any 
elementary theory of concatenation which provides at least two 
atoms. We then saw in §4 how that same arithmetic can be con- 
structed bilaterally within any elementary theory of concatenation 
which provides an explicit finite number of atoms, at least two. 

What has thus been shown for the elementary arithmetic of posi- 
tive integers could be shown equally for the elementary arithmetic 
of natural numbers (i.e., positive integers and 0). This follows from 
the fact that the elementary arithmetic of natural numbers can be 
constructed in turn, and indeed in bilateral] fashion, within the ele- 
mentary arithmetic of positive integers. A method of carrying out 
the last-mentioned construction will now be presented. 

The natural numbers, which I shall call 0,, 1,, 22, etc., may be 
arbitrarily identified with the positive integers (which I shall con- 
tinue to call 1, 2, 3, etc.) thus: 


or ln, en 
"2a 3 


I shall call 1, the nominal correspondent of 1, and 2, of 2, and so on, 
alluding thus to the resemblance in names; but nominal corre- 
spondence is not to be confused with the real correspondence 
(cdentity) which, under our construction, holds rather between On 
and 1, between 1, and 2, and so on. In general, the nominal corre- 
spondent of any integer k isk + 1; 2n, for example, is 3. 

Now let us consider how to define the addition ‘+,,’ of the arith- 
metic of natural numbers, in terms of the addition ‘++’ of the arith- 
metic of positive integers. Where x’ and y’ are the nominal corre- 
spondents of x and y, we shall want xz’ +, y’ to turn out as the 
nominal correspondent of x + y. (E.g., we want 5, +n 6, to be 11,.) 
Therefore, since the nominal correspondent of any integer k is 
k + 1, we want to realize the identity: 


(12) (ae Digien (Yee) eaters. 


Similarly, where ‘x Xn y’ represents the product of natural arith- 
metic and ‘x 4, y’ the power, we want to realize the identities: 


(13) @+t)xkKiG@+tl=@xy +1, 
(14) (tel)! 7a 2) eet, 
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where ‘x X y’ and ‘zr’ represent the product and power as of posi- 
tive arithmetic. 

The desiderata (12)-(14) are not quite enough to determine our 
choice of the general definitions of ‘+,’, ‘X,’ and ‘/,’, because 
they describe these operations only in application to numbers of the 
form ‘x + 1’ and ‘y + 1’; and there is one number, namely, 1 
(= 0,), which cannot be rendered in that form. So we must supple- 
ment (12)—(14) with stipulations of what we want when one or both 
operands are 0,: 


(15) On +nz = 4%, 

(16) 24-7 Oh = 2 

(17) 0; nz = 05=— 1, 
(18) z X10, = 0, = 1, 
(19) 0,.Ar.(y+1) =0, = 1, 
(20) z/,0, = 1, = 2. 


Verification that the following definitions meet the requirements 
is left to the reader. 


etay =a(2\(zty =z+1), 
zXay =a (zz t+yt2= (eX y) + QI, 


tgAay=aaly=1l.z=2.V.yXx1.2=2=1.V Gwe 
w+il.zXw=vw'+w)). 


Vi 


«t Set-theoretic Foundations 


for Logic 


1. Introduction. In his set theory, 1908, Zermelo uses the vari- 
ables ‘x’, ‘y’, etc., for the representation of ‘things’ generally. 
Among these things he includes sets, or, as I shall say henceforth, 
classes. He adopts the connective ‘e’ of membership as his sole 
special primitive; thus the elementary formulae of his system are 
describable simply as expressions of the form ‘z ¢ y’, with any thing- 
variables ‘x’, ‘y’, ‘2’, etc., supplanting ‘x’ and ‘y’. The postulates of 
his system are so fashioned as to avoid the logical paradoxes with- 
out use of the theory of types. One of the postulates, the so- 
called Aussonderungsaxiom, may be stated in familiar logical nota- 
tion as: 


(Az)(y)((y ex) = (yez). E(y))) 


where ‘€ (y)’ is understood as any statement about y which is defi- 
mite in a certain sense which Zermelo introduces informally for the 
purpose. Skolem in 1930 pointed out that it is adequate here to con- 
strue “definite” statements as embracing just the elementary for- 
mulae and all formulae thence constructible by the truth functions 
and by quantification with respect to thing-variables. A second of 
Zermelo’s postulates is the principle of extensionality; this asserts 
that mutually inclusive classes are identical, i.e., are members of 
just the same classes. There are further postulates which provide 
for the existence of the null class, the class of all subclasses of any 


Reprinted from the Journal of Symbolic Logic, Volume 1, 1936. 
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given class, the class of all members of members of any given class, 
the unit class of any given thing, and the class whose sole members 
are any two given things. Finally the multiplicative axiom (Aus- 
wahlprinzip) and the axiom of infinity are adopted. 

In the expression of Zermelo’s postulates and the proof of his 
theorems the notions and technique of elementary logic are pre- 
supposed as an implicit substructure. The only logical notions 
which are here required are the truth functions and quantification 
with respect to thing-variables, and the only logical technique re- 
quired is that governing the truth functions and quantification. To 
frame the Zermelo system as a deductive system in its own right, 
then, we must extend its foundations to include this part of logic, 
in application to the elementary formulae. This can be done by 
adopting quantification and the truth functions (or an adequate 
selection of these) as further primitives, and providing the system 
with rules of inference of such kind as to provide all the deductive 
technique of truth functions and quantification. 

It is usual, in the full presentation of a deductive system, to begin 
metamathematically by specifying what expressions are to be 
formulae of the system; then to use this notion of formula in a meta- 
mathematical account of the rules of inference of the system. In the 
present case, since the primitives are just membership, quantifica- 
tion, and the truth functions, clearly the formulae of the system 
will embrace just the ‘‘definite”’ statements as construed by Skolem. 
As to the rules of inference, one will be the familiar logical rule 
which allows the inference of a formula f as a theorem whenever the 
result of substituting f for ‘Q’ and some theorem for ‘P’ in ‘P D Q’ 
is a theorem; and there will be other such rules, of familiar kind, 
providing in the aggregate for the whole technique of truth func- 
tions and quantification. Moreover, since a “definite” statement is 
now simply any formula of the system, we can express the Aus- 
sonderungsaxiom itself as another ordinary rule of inference, which 
we may call the Aussonderungsregel: Each result of putting a formula 
for ‘P’ in 


‘(Ax)(y)(y €x) = (ez). P))’ 


is a theorem. (Strictly, we should add explicitly that ‘x’ must not be 
a free variable of the substituted formula.) The Aussonderungs- 
axiom thus ceases to depend on the notion of ‘‘definiteness” and 
acquires the same formal status as the logical rules of inference. 
Finally the rest of Zermelo’s postulates can be translated into the 


SET-THEORETIC FOUNDATIONS 85 


primitive notation and retained as formal postulates. Zermelo’s sys- 
tem is thus transformed into a formal deductive system containing 
rules of inference and postulates. 

The purpose of this paper is the presentation of a system T which 
resembles the above system but is more economical. TI has the same 
rules of inference as the above system, including the Aussonderungs- 
regel; the latter, however, is modified to the extent of supplanting 
‘y ez’ by the inclusion ‘y C 2’, which we may suppose defined in the 
familiar fashion. By way of formal postulates, ! contains only the 
extensionality principle; the described modification of the Aus- 
sonderungsregel enables us to abandon all the other postulates and 
still derive the whole of standard mathematical logic, as measured, 
eg., by P.M. 

This does not mean that T, so constituted, is adequate to the 
derivation of all Zermelo’s postulates; it means that T is adequate 
to the derivation of as many consequences of Zermelo’s postulates 
as are needed for standard logic in the adopted sense. In particular 
the multiplicative axiom and the axiom of infinity are as requisite 
to I’ as they are to Zermelo’s system; I follow P.M.,? however, in 
suppressing these as postulates and requiring their statement rather 
as explicit hypotheses wherever necessary. 

There is a further respect in which I is to be contrasted with 
Zermelo’s system and others: it contains no null class. The null 
class presupposes a distinction between classes and individuals 
(non-classes) which is inexpressible in terms of membership, since 
the null class is like an individual in lacking members. Repudiation 
of the null class enables us to explain the variables of I as repre- 
senting objects generally, classes and otherwise, and then to define 
classes simply as objects y such that (az) (a € y). 

Though T lacks two chief features of the standard logic, viz., 
types and the null class, we shall see that within I we can construct 
definitionally a derivative theory A in which both the null class and 
the familiar stratification into types are ostensibly restored and all 
theorems of the standard logic are forthcoming. 


2. Formal Construction of T. The P-formulae, or formulae of I, 
are describable recursively thus: expressions of the form ‘(z € y)’, 
with any I-variables ‘2’, ‘y’, ‘2’, ete., supplanting ‘x’ and ‘y’, are 
T-formulae; so also are the results of putting any I-formulae for 


1 ‘Whitehead and Russell, 2d ed. 
2 Vol. 2, 183; Vol. 1, 481. 
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‘P’ and ‘Q’ and any I-variable for ‘x’ in ‘~P’, ‘(P D Q)’, and 
‘(z)P’. A T-variable x is said to be bound at a given point (say at the 
kth sign) of a T-formula p if that point lies with a T-formula q 
which forms part of » and begins with a parenthesized occurrence of 
r (as universal quantifier). An occurrence of r is free in p if x is not 
bound at that occurrence in p. r is said to be free in ), or to be a free 
I-variable of p, if x has a free occurrence in p. 

I'-formulae are abbreviated through the following familiar 
definitions. 


Di. (P.Q) =a ~(P D ~Q). 

De: (P = Q) =ar((P D Q) . (QD P)). 
D3. (Az) P =a: ~(z) ~P. 

D4. (x Cy) =ar @)(@ex) D Gey). 


Definitions are viewed as conventions extraneous to the formal sys- 
tem. For formal purposes we may think of definitional abbrevia- 
tions as expanded always into full primitive expression. 

Among various ways of compressing the familiar technique of the 
propositional calculus and theory of quantifiers into a few rules for 
the generation of theorems in I, one way is to declare that a I- 
formula f is a I-theorem under any of the following conditions: 

I'l. fis formed by putting I-formulae for the capitals in one of 
the schemata: 


(a) (CP By Die Dk) OD WS H))), 
(b) (PD @P Di), 
(c) UE 2) Dok 


Y2. The result of putting f for ‘Q’ and some I'-theorem for ‘P’ in 
‘(P > Q)’ is a T-theorem. 

13. There are I-formulae p and q and a I-theorem r such that f 
and r are formed respectively from ‘(P D (x)Q)’ and ‘(P D Q)’ by 
replacing ‘P’ by p, ‘Q’ byq, and ‘x’ by a ’-variable which isnot free in p. 

14. There are I-formulae p and q and a I'-theorem r such that f 
and r are formed respectively from ‘(P > Q)’ and ‘(P D (z)Q)’ by 
replacing ‘P’ by p, ‘Q’ by q, and ‘z’ by a I-variable. 

T'5. f is formed from a I-theorem by replacing all free occur- 
rences of a I-variable r therein by a I'-variable which is not bound 
at any of those occurrences of r. 
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T'1-2 are an adaptation of one of Lukasiewicz’s systematizations 
of the propositional calculus. They provide as I'-theorems all I- 
formulae which are instances of valid forms of the propositional 
calculus.4 2-5 answer to Tarski’s definition of logical consequence.® 
Given the machinery of the propositional calculus as provided by 
11-2, P3-5 provide all the familiar technique of quantification. 

As intimated in §1, the general theory of deduction as embodied 
in T'1-5 is supplemented in I by one special postulate and one 
special rule as follows: 


re. 3 ‘“((@Cy) D (YC 2) D (ex) D (ew) D (yew)))))’ 


is [a transcription, under D4, of] a I'-theorem. 

I'7. If ‘P’ is supplanted in ‘(ax)(y)((yex) = ((y Cz). P))’ by a 
T-formula in which ‘z’ is not free, the result is [a transcription, 
under D1-3, of] a T'-theorem. 

The clause ‘(z ¢ x)’ in the extensionality principle I6 is inserted 
to restrict the principle to classes and thus leave room for the 
existence of more than one individual. 

As framed above, I involves denial, material implication, uni- 
versal quantification, and membership as primitive. Actually 
denial is superfluous; we could define it thus: 


~P =a (PD @)Y)@ey)). 
(a)—-(c) could then be supplanted in Tl by: 


i=) > (i D8) D T)) D 
CU DG DS) St) De OU) So oe BD, 


which answers to Wajsberg’s postulate for the calculus of material 
implication.’ The resulting system is demonstrably equivalent to 
I. If on the other hand it is thought desirable to provide for the 
properties of the truth functions independently of those of member- 
ship, as is done in I’, there remains an alternative reduction of 
primitives, namely the adoption of Sheffer’s stroke function instead 
of implication and denial.’ [2 would then be refashioned to match 


2 See Lukasiewicz and Tarski, Satz 6. 

4See my “Truth by convention,”’ note 18 [17 in one edition]. 

® Tarski, 1933, Def.9. Note that “Fr(@)’’ here is a misprint for “Fr(n)’’. 
6See Lukasiewicz and Tarski, Satz 30. 

mee PM ., Vol. 1, xiii, xvi. 
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Nicod’s rule of inference,’ and (a)-(c) would be supplanted in 
Il by: 


(P| 1 ®)) | (S| SI S)) | CS] Q) | (P| S) 1 (P18), 


which answers to Lukasiewicz’s reduced form? of Nicod’s postulate. 
Throughout this paper, however, it will be convenient to retain I 
in its original redundant form. 


8. The System A. A system A due to Tarski’ will be identified 
with “standard logic’ and shown to be derivable from I. A, unlike 
I, countenances nullity of classes and imports the simple theory 
of types. A distinctive kind of variables is appropriated to each 
type. All the A-variables are I-variables with positive integral 
indices attached: those with the index ‘1’ take individuals as 
values, those with ‘2’ take classes of individuals, those with ‘3’ take 
classes of such classes, and so on. The I-variable to which the 
index is attached will be called the base of the A-variable. 

The elementary A-formulae are formed by replacing ‘z’ and ‘y’ 
in ‘(z ey)’ by A-variables whose indices are consecutive and ascend- 
ing. These formulae are elaborated into further ones, as in I’, by 
the truth functions and quantification. For simplification of later 
matters it will be demanded that no A-formula contain A-variables 
with like bases and unlike indices; this restriction does no violence 
to Tarski’s scheme, since it touches only the arbitrary choice of 
letters. A A-formula is describable in general, then, in terms of the 
I-formulae, as any expression f such that every [-variable in f 
bears a numeral (positive and integral, as will always be understood 
hereafter) as index, all like T-variables in f bear like indices, ‘e’ 
occurs only between I-variables bearing consecutive ascending in- 
dices, and the result of deleting all indices from f is a T'-formula. 
A-formulae are then abbreviated as in D1-2. 

Despite its simplicity, the language of A is adequate “zum 
Ausdrucke jedes in Principia Mathematica formulierbaren Ge- 
dankens.””!! Relations, e.g., can be introduced in terms of classes by 
the method of Wiener or Kuratowski. 

A A-formula f is declared to be a A-theorem under any of the 
following conditions: 

Al-4. (Same as 'l—4, but with ‘A’ put for ‘T’ throughout.) 

8 Tbid., xvi. 

9See Lukasiewicz, ‘‘Uwagi o aksyomacie Nicod’a’’. 

10 1933. My formulation departs from Tarski in inessential respects, largely 


notational. 
Tarski, 1933, 97. 
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Ad. [A transcription, under D1-2, of] f is formed by putting 
A-variables for the letters in ‘((z)((zex) = (zey)) D ((rew) D 
(yew)))’. 

A6. [A transcription of] f is formed by putting A-variables r and 
y for ‘zx’ and ‘y’ in ‘~(xz)~(y)((y ex) = P)’ and supplanting ‘P’ 
by a A-formula in which ¢ is not free. 

For purposes of Al-6 the words “bound” and ‘“‘free” are of 
course explained just as in §2, but with ‘A’ supplanting ‘T’. 

In Tarski’s formulation there is a further A-rule, paralleling T5. 
He needs this because in his formulation of A5-6 he specifies the 
bases of the A-variables involved. It has been convenient here, on 
the other hand, to phrase A5-6 without reference to specific sets 
of variables. Since there is consequently no such reference any- 
where in Al-6, the substitution of one free variable for another in 
a theorem can be accomplished by repeating the whole derivation 
of that theorem under a different initial choice of variables. A rule 
of substitution parallel to 5 therefore becomes superfluous. Note 
that T'5 could similarly be eliminated from IT, by so rephrasing 
I'6-7 as to avoid reference to specific variables. Conversely, also, 
the system A admits of reduction in ways parallel to those sketched 
fom I in §2. 


4. Construction of A Within T. A differs from T not only in de- 
pending upon the theory of types but also in involving some extra 
elements, viz., the null class in each type. The problem of deriving 
A from T thus resembles the problem of deriving the system of 
rational numbers from that of integers, or the system of integers 
from that of natural numbers. In the latter derivation the natural 
numbers are not directly supplemented by the negative integers; 
rather the whole series of integers is developed anew, in such fashion 
that the non-negative integers do not coincide with the natural 
numbers with which they are intuitively associated. A similar 
course is adopted in deriving A from T; rather than directly supple- 
menting I’ somehow with null classes, the procedure is to present a 
general reconstruction of the elements of A in terms of I. 

For each element of A let us define a correspondent recursively 
thus: the correspondent of an individual is its unit class, and the 
correspondent of a class x is the class whose members comprise all 
individuals and in addition the correspondents of the members (if 
any) of «. Now I incorporate A into I by reinterpreting A to 
the extent of supposing all its elements replaced by their corre- 
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spondents. Thus a A-element of first type, represented in A by a 
variable with ‘1’ as index, and formerly construed as an individual, 
is reconstrued as a class (a unit class, as it happens) of individuals. 
A A-element of n + Ist type, formerly construed as a class of A-ele- 
ments of nth type, is reconstrued in two respects: its members are 
reconstrued in the manner appropriate to the nth type, and it is 
then assigned all individuals as additional members. In particular, 
thus, the null class of each type is reconstrued simply as the class of 
all individuals. All A-elements are thus reconstrued as classes; 
and this in the strict sense of I, viz., classes with members. 
Now ‘1!z’, read ‘zx is of first T-type’, is defined in I thus: 


D5. I!x =ar (Gy)(y ex). y)(Yex) D (w) ~ (wey))). 


Thus 1!z if and only if « has members but none of them have 
members in turn; i.e., the first T-type comprises all and only the 
classes of individuals. Hence the ‘‘individuals”’ or first-type ele- 
ments of A, reconstrued as they are as certain classes of individuals, 
become classes of first T-type. 

Next, given the nth T-type, the n + Ist is defined thus: 


D6. n+ Ile =a (y)(((yex) = w)(wey)) D nly). 


Where n + 1!z, D6 obviously provides that if y ex and y has mem- 
bers then n!y; further D6 can be shown to provide that y « x if y has 
no members.” In effect, then, the n + Ist I'-type is defined as com- 
prising just those classes x which have all individuals and in addi- 
tion none but classes of nth T-type as members. From the described 
reinterpretation of A it thus follows that if all A-elements of nth 
type are reconstrued as classes of nth I'-type then all A-elements of 
n + Ist type are reconstrued as classes of n + Ist T-type. Since all 
A-individuals are reconstrued as of first T-type, it follows by induc- 
tion that the A-elements of every type n are reconstrued as of nth 
I'-type. In particular the null class of any type n, reconstrued as it 
is as the class of all individuals, turns out to belong at once to all 
I'-types.'8 

Formally the incorporation of A into [ will consist in regarding 
each A-variable, having say the nth numeral as index, as restricted 
in range to the nth I-type. Thus zndered quantifiers are introduced 
into I through this definition: 


D7. . GP =" Aaa. 


12 See the next-to-the-last formula in the proof of Lemma 3, below. 
13 This is the basis of the proof of Lemma 5, below. 
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This convention is added: 

C1. An index which is attached to a I-variable within an indexed 
quantifier may be attached also at will to any recurrence of that 
variable within the scope of that quantifier. 

Strictly, D5-6 should be regarded not as a definition of the gen- 
eral case ‘n!z’ but as a convention for the adoption of an infinite 
series of specific abbreviations ‘1!z’, ‘2!z’, ete. D7 then introduces 
a further series of abbreviations. 

A A-formula will be called closed if it contains no free A-variables. 
Now let p be a A-formula with just k(20) free A-variables. Any 
result of putting p for ‘P’ in ‘(a,™) . . . (a,“)P’, and putting the 
free A-variables of p in any order for ‘x,’ to ‘a,’, will be called a 
e.g. (closed generalization) of p. Now D1-7 and Cl explain every 
closed A-formula as a conventional transcription of a T-formula. In 
particular, all closed A-theorems are thus translated into T. The 
A-theorems in general may then be translated into I by interpreting 
each as a closed theorem in the familiar fashion, viz., by supposing 
each supplanted by one of its c.g.’s.4 (If a A-theorem is closed to 
begin with, it is of course its own c.g.) To show that the system A 
proceeds from I, then, we need only show that every c.g. of a 
A-theorem is [a transcription, under D1-7 and Cl, of] a I-theorem. 


5. Proof That Every c.g. of a A-theorem Is a T-theorem. The proof 
will only be sketched here; in rigorous form it would involve elabo- 
rate metamathematical developments and run to greater length. 
All the familiar technique of the propositional calculus and theory 
of quantifiers will be taken for granted, since there is no essential 
difficulty in showing that the technique accrues to Fr through P'1-5. 
Such clauses as the bracketed ones in '6-7 and A5-6 and at the end 
of §4 will be suppressed entirely, as in the title of $5; in effect, thus, 
definitional transcriptions of I'-formulae or of A-formulae will be 
treated as I-formulae or A-formulae in their own right. 


Lema 1. If T-formulae are put for the capitals in the following 
schemata, the results are T'-theorems: 


(a) (P DQ) D (CR = ~P) D Q) D (BD Q))) 
(b) (QD ~P) D (Rk = ~P) DQ) D (PD R))) 
eye. Q).>) (i DS)) D (P Dif) D ((P «Q) DS) 


“See P.Af., Vol. 1, xiii. Indeed, Tarski has revised A in just this fashion 
(1935, 365-366, 291-299). 
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(d) (WP = Q) DR). (S = Q) DR)) D 
(# D (P = 8)) D (PDS). BD P))) 


(e) (PDQ) (EDP) DD QDS = @. ae 
(R.T)))) D(S = ~P) DR). (DS = T))))))) 


Proof. All these are valid forms of the propositional calculus, as 
ean be checked by the truth-table method or otherwise. 


Lemma 2. The following are T-theorems: 
(a) (w) ~ wey) DY C2) 
(b) @x)(y)(yex) = Y C2)) 
(c) (y)(w) ~ wey) 
(d) (z)(y)((yex) = w) ~ Wey)) 
(e) (ax)1 fa 
(f) (ax)(z)(@ C 2) 


Proof. The case of (a) is clear from D4. (b)—(d) are derivable as 
follows. By I7, 


GniniGer) =U ez) 7 a2). (1) 
(Ax) (y)((yex) = ((y C2). (w) ~ Wwey))), (2) 
and (dy) (w)((w ey) = (w C 2). ~@ C 2))). (3) 


(b) proceeds from (1), (c) from (3), and (d) from (2) and (a). (e) and 
(f), finally, are derivable as follows. By (d), 


(z)((y)((w) ~ wey) D (yex)). (y)(Yex) D (w) ~ Wey))). 
Hence, by (c), (ax)(Gy)(y ez). (y)((yex) D ww) ~ Wey))). 
Again, by (ad), (z)(z)((y)(y ez) D (w) ~ (We y)) D 

(y)((y ez) D (yex))). 
Hence, by Dé, (ax) (z)(1lz D (y)((yez) D (ye =))). (5) 
In view of D5, D4, and D7, (4) and (5) are (e) and (f). 
Lemma 3. A I-theorem results from putting any numeral for ‘n’ in: 
(a) (nly D Aw) ey)). 
Proof. Let the nth and n + Ist numerals be represented as ‘n’ 


and ‘n + 1’, and suppose (a) to be a '-theorem. Then we can derive 
another I'-theorem as follows. By Lemma 1(b) and D3, 
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(nly D @w)(wey)) D (((yex) = Aw)wey)) D nly) D 
((w) ~ Wey) D (yex)))). 
Hence, by (a), ((((yex) = G@w)Wey)) D nly) D 
((w) ~ wey) D Yex))). 
Hence, by D6, (n + 1!z D (y)((w) ~ (Wey) D (yex))). 
Hence, by Lemma 2(c), (n+ 1!lzD Gy)(y €2)). 
In this way, if the result of putting the nth numeral for ‘n’ in (a) 
is a I'-theorem, the result of putting the n + 1st numeral for ‘n’ in 
(a) is derivable as a I'-theorem. But the result of putting ‘1’ for ‘n’ 


in (a) is a I-theorem, as is seen immediately from D5. Lemma 3 
then follows by induction. 


Lemna 4. A I-theorem results from putting any numeral for ‘n’ in 


(a) (az) (y")(y C 2). 


Proof. Let the nth and n + lst numerals be represented as ‘n’ 
and ‘n + 1’ and suppose (a) to be a '-theorem. Then we can derive 
another I'-theorem as follows. By D6-7, 


((y")(y Cz) D 
(x)(n + lz DY) (yer) = Gw)Wey)) D &Y C2)))). 


Hence, by (a), 

(3z)(z)(n + Ile D (y) (yer) = Gw)(wey)) D YC -z))). (1) 

By Lemma I(a) and D3, 
((w) ~ Wey) D YY Cz) D 

(yer) = Gw)Wwey)) D YY Cz)) D (yez) D Y Ca2)))). 
Hence, by Lemma 2(a), 

((((yex) = (Aw)(wey)) D Y Cz)) D (Yex) D Y Ca2))). 
Hence, by (1), Gz)(z)(n + 1!z D (Yy)(yex) Dy C2))). 
Hence, by Lemma 2(b), (32) (x) (n + 1!2 D (y)((yex) D (ye £))). 
Le., by D7 and D4, Gace) (e Ga 

In this way, if the result of putting the nth numeral for ‘n’ in (a) 
is a I-theorem, the result of putting the n + 1st numeral for ‘n’ 


in (a) is derivable as a T-theorem. From Lemma 2(f), then, Lemma 
4 follows by induction. 


Lemma 5. A I-theorem results from putting any numeral un for ‘n’ 
an ‘(dx)n 2’. 
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Proof. Lemma 2(e) covers the case where n is ‘1’. If nis not ‘1’, 
then, representing n as ‘m + 1’, we can derive the I'-theorem thus. 
By T1(b), 


(yer) = Ww) ~ Wey)) D (~(Yer) = W) ~ ey)) D mly)). 


Hence, by Lemma 2(d), 
(az) (y)(~(Yy ex) = (w) ~ (we y)) D my). 


Le., (ax)(y)(C(y ex) = ~Ww) ~ Wey)) D mly). 
Le., by D3 and D6, (3x)m + I!e. 


DEFINITION. If p is a A-formula, that I-formula which is ob- 
tained by dropping the indices from the free A-variables of p will be 
called |p|. 


Lemna 6. If t: tox (0 S k) are the bases of the free A-variables of a 
A-formula p, 1 to nz are the respective indices attached, and q is the 
result of putting |p| for ‘Q’, 11 to vz for ‘x1’ to ‘x;’, and n to ny for ‘(1)’ 
to ‘(k)’ in: 


((Cilay. . 2 Get) DO) 
then q 1s a T-theorem if and only if every c.g. of p is a T-theorem. 


Proof. Every c.g. v of p has the form ‘(y,") . . . (y,!*!)P’, with 
‘P’ replaced by p, and ‘y;’ to ‘y;’ replaced by the r; in some permuta- 
tion, and ‘[1]’ to ‘[k]’ replaced by the n; in the corresponding permu- 
tation. By D7 and Cl, tr will be a I-theorem if and only if 8isa 
T'-theorem, where 8 is formed from ‘(y;)({I]!y D . . . (yz) ([k] !yz D 
Q) ... ) by putting |p| for ‘Q’ and replacing ‘y:’ to ‘y;’ and ‘{1]’ to 
‘{k)? as before. But, by the propositional calculus and theory of 
quantifiers, 8 will be a I'-theorem if and only if the result t of the 
same replacements in ‘(({I]!y. .. . [k]!y.) D Q)’ is a I-theorem. 
But t is carried into q and vice versa by permutations which are 
allowed by the propositional calculus. Hence t, and therefore r, will 
be a T-theorem if and only if q is a I'-theorem. But r was any c.g. 
of p. Hence q is a I-theorem if and only if every c.g. of p is a 
I-theorem, q.e.d. 


Lemma 7. If pis a A-formula and |p| ts a T-theorem, every c.g. of 
as a T-theorem. 


Proof. If |p| is a T-theorem, the q of Lemma 6 follows immedi- 
ately as a I-theorem. By Lemma 6, then, every c.g. of p is a 
I'-theorem, q.e.d. 
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Lemna 8. Jf p, g, t, and 8 are A-formulae, r ts a A-variable, and r 
and 8 are formed respectively from ‘(P D Q)’ and ‘(P D (2)Q)’ by 
putting p for ‘P’, q for ‘Q’, and x for ‘x’, then (a) zf all c.g.’s of » and of 
t are T'-theorems, so are all c.g.’s of q; (ii) uf ris not free in p, and all 
c.g.’s of rv are T-theorems, then so are all c.g.’s of 8; (ili) if all c.g.’s of 
8 are T-theorems, then so are all c.g.’s of t. 


Representations. Let those A-variables which are free in p but not 
in q be represented as ‘x,’ to ‘x%—!’ (1 S Ah), let those which are 
free in both p and q be represented as ‘x,’ to ‘2,’ (h S k + 1), 
and let those which are free in q but not in p be represented as 
“eGt? to ‘tm (k Sm). Let p and q be represented accordingly 
as ‘o(a™, . . . a) and WW(a,™, 2. . tm)’. Let x be repre- 
sented as ‘z,™’, where n is 0, m, 1, or h according as ¢ is free in 
neither p nor q (Case 1), in q but not in p (Case 2), in p but not 
in q (Case 3), or in both p and q (Case 4). Consequently 


iy. . . te)’, Wis .. Bag), 
Ge: . ate) De iy ome)! 
answer respectively to |p|, |q|, and |r|, and |8] is represented as 


“(O(a - « « Te) D (a™)WGr, . . « Tm)’ 
(n < h; Cases 1 and 3) 


or "Oe... ti) > Ge UG, .. . en eee) 
(Case 2) 


or ‘(o(21, 55 8 Lx) wy) (r.)Yar™, Th+ly 2 + + Tei))” 
(Case 4). 
Proof of (i). A T-theorem is derivable as follows. By the hy- 
pothesis of (i), together with Lemma 6, 
((()iop. . . . Cele) Da, . . . mn) (1) 
and 
ea... . (alt) D Gi, . . . te) DHG,, .. - Ze). ©) 
By Lemma l(c), 
ian. ... (mtn) > (6G, . . . DIG, . . « Ga) D 
tee. ... (ike) Da, ... eo) D (CA)... . 
Ge) ie) DD Gries . . Gom)))): 
Hence, by (2) and (1), 
Cn cs cgilGtt) ties) Way. . . em). 
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Le., 


(Gan(hila D> .«. (Sera) (h — lien DIG) a. . 2 
(m) lam) DW, . . . m))) ee 


Hence, by Lemma 5, (((hA)!lm....-. (mjlz,) DW, .. ae 


By Lemma 6, then, every c.g. of q is a I'-theorem, q.e.d. 

Proof of Gi). By the hypothesis of (ii), together with Lemma 6, 
(2) above is a I-theorem. By hypothesis, also, Cases 3 and 4 are 
excluded. In Cases 1 and 2, I'-theorems are derivable as follows. 

Case 1. By (2), 


W@biicn. .. . (m)lzn,) D (6%, . . ap D 
(tn) ((n) ln D Wa, . . . Ga)))). (3) 


eeeby DIP Gh. .. Clee DIGG ans 
(ta Way» - » Tm))). (4) 


By Lemma 6, then, every c.g. of 8 is a I-theorem, q.e.d. 
Case 2. By (2), 


Cahir... .. (am — 1)lz,4) DIGG azn D 
(tm) (Cm) tm D W(ta, . . - Xm)))). (5) 


Le., by D7 and Cl, 


Mitr. =. . (a — 1) ena) DEG. ..222) D 
(im) (rn, - . . Sma1, Sma) 


By Lemma 6, then, every c.g. of 8 is a I'-theorem, q.e.d. 


Proof of (iii). Cases 1 and 3. By the hypothesis of (i1), together 
with Lemma 6, (4) is a I-theorem. By D7, then, so is (3). Hence 
SO is: 


((ijlmn.... (mle) D (6, . . Be Ge Gila 
WArn, . . . Cale 


and consequently, by Lemma 5, so is (2). 

Case 2. By hypothesis and Lemma 6, (6) 1s a T-theorem. By D7 
and Cl, then, (5) is a T-theorem. Hence so is (2). 

Case 4. (2) is derivable as follows. By hypothesis and Lemma 6, 


((d)ic. . . . (lan) D (61, . Se 


(an) on™, trea, . . . Cae 
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Le., by D7 and Cl, 


ie... Gai.) OG, ..- 2) DT GO 
Wim, : . . Bq)))). 


Hemee (((1)!lm... . (m)lem) D (@, « . . me) D 
(0) ie Way . . . @ad)))- 


Deletion of the repeated premise reduces this to (2). 
(2) is thus a I'-theorem in all four cases. By Lemma 6, then, all 
c.g.’s of r are I-theorems, q.e.d. 


Lemma 9. If the A-formula { 1s formed by putting A-formulae for 
the capitals in (a), (b), or (c) of T'l, then every c.g. of f is a T'-theorem. 


Proof. If { results from putting p, q, and r for the capitals in (a), 
then |f| results from putting the '-formulae |p|, |q|, and |r| for the 
capitals in (a). Then, by Yl, |f| is a T-theorem. Consequently, by 
Lemma 7, every c.g. of f is a T'-theorem. The cases of (b) and (c) 
are analogous. 


Lemma 10. Jf a A-formula f is formed by putting A-variables x, y, 
3, and W for ‘x’, ‘y’, ‘2’, and ‘w’ in ‘((z)((@ex) = (zey)) D (wew)D 
(y ew)))’, every c.g. of f is a T'-theorem. 


Proof. Let us represent r, ), 3, and ww as ‘a*+!’, ‘y"+?’, ‘2"’, and 
‘wt’, (The indices must be consecutive as indicated, since f is a 
A-formula.) We can derive a I’-theorem as follows. By Lemma 1(d), 


((((zex) = Gw)(wez)) D niz). (ey) = Gw)(w ez)) D nkz)) 
> ((nkz D (ex) = Zey))) D 
((zex) D Zey)) - (ey) D @ez))))). 


Hence, by D6-7, C1, and D4, 
(n+ 1iz.n+ lly) D 
((2")(@* ex) = ey) D (@ Cy). Ca)))). 
Hence, by Lemma 3, 
(n+ 1la.n+1!ly)D 
((e") (Ce ex) = Gey) D (eCy)-¥ Cz). Ge)(ez)))). 
Hence, by T6, 
(m+ lla.n+ lly) D 
((2")((2" ex) = (4 €y)) D (Wew) D (yew)))). 
Hence 
(n+ 2lw.n+ 1lla.n+ 1ly) D 
((2")((2" ex) = (2"ey)) D (ew) D (yew)))). 
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Then, since ‘((2”)((2"«z) = (2"€y)) D ((rew) D (yew)))’ is |fl, 
it follows from Lemma 6 that every c.g. of f is a I-theorem, q.e.d. 

Lemma 11. If a A-formula f is formed by putting A-variables x and 
y for ‘x’ and ‘y’ in ‘~(x)~(y)((y ex) = P)’ and supplanting ‘P’ by 
a A-formula pin which t is not free, then every c.g. of f is a T-theorem. 

Proof. Let ) and r be represented as ‘y,” and ‘r+, and let the 
free A-variables of p be represented as ‘y;% to ‘y,% (OSA S 
kA =h-+1). If y is free in p, h = k; otherwise h = k — 1. Finally, 
let us represent p as ‘d(y:™, .. . yx)’. Now a I-theorem is 
derivable as follows. Let 


‘(yee x) = (yx C 2). (Gw)we yr) D (Ck) lyn oC, - - ~ yad))))’ 
be abbreviated as ‘&’. By Lemma I(e) and D3, 


(((w) ~ (we yx) D (yx C 2)) D (CCA) yz D (Aw) (w € yx)) D 
(((k) yx D (yz C 2)) D (@D (Cyn, € x) = (aw) (we yx)) D 
(k) ly.) . (Kk) lyn D (Yrer) = OQ, . - - UAE 


Hence, by Lemmas 2(a) and 3, 
(A) ly. D (Ye C 2) D (@D (CY ex) = Aw) (we yx)) D 
(kK) lyx) « (CK) yx D (yee x) = (ys, . - - yn)))))). 
Hence, by D6-7, 
(yx) (yx C Zz) D 
((yx) BD ((k) + Ile. YH) (yee x) = oH, - . - Yya))))). 


But, by '7, (Az) (y,) &. Hence 
(yx) (yr C 2) D 
(Az)((k) + Iit. Ge) (iy, ex) = oY, . . . yn)))).- 


Hence, by Lemma 4, 

(Ax)((k) + Iie. (yx) (yn ex) = (yr, - - - Ya))). 
Le., ~(z)(k) + le D ~Yy) (ye € x) = 6, - - - Yr))). 
I.e., by D7 and Cl, 

MOM) ~ (YX) (Ye € 2H) = oy, « - - Ya))- 


The I-theorem thus derived is |f| if h = k — 1. If h = k, it be- 
comes |f| when ‘y,’ is rewritten according to C1 as ‘y,’. In either 
case, then, it follows from Lemma 7 that every c.g. of f is a I- 
theorem, q.e.d. 


THEOREM. Every c.g. of a A-theorem is a T-theorem. 
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Proof. A A-theorem f will be said to be of rank 0 if it satisfies Al, 
A5, or A6. A A-theorem f will be said to be of rank n + 1 if there are 
A-theorems p and rt, one of rank n and the other of rank n or less, 
such that r is the result of putting p for ‘P’ and f for ‘Q’ in 
‘(P D Q)’; or if there are A-formulae p and q and a A-theorem tr of 
rank n such that f and r are formed respectively from ‘(P > (x)Q)’ 
and ‘(P > Q)’ by replacing ‘P’ by p, ‘Q’ by a, and ‘x’ by a A- 
variable which is not free in ); or if there are A-formulae p and q and 
a A-theorem 8 of rank n such that f and 8 are formed respectively 
from ‘(P > Q)’ and ‘(P > (2)Q)’ by replacing ‘P’ by », ‘Q’ by a, 
and ‘x’ by a A-variable. 

If we assume that every c.g. of any A-theorem of rank 7 or less is 
a I'-theorem, and take f as a A-theorem of rank n + 1, then Lemma 
8 shows that every c.g. of fis a I-theorem. Then, since by Lemmas 
9-11 every c.g. of any A-theorem of rank 0 is a [-theorem, it fol- 
lows by induction that every c.g. of any A-theorem of rank m is a 
I-theorem, for every natural number m. But it is clear from the 
above definition of rank, together with Al-6, that every A-theorem 
has at least one natural number as its rank. Therefore every c.g. of 
any A-theorem is a I’-theorem, q.e.d. 


VI 


«t Logic Based on Inclusion 
and Abstraction 


All notions of mathematical logic, as measured, e.g., by Principia 
Mathematica, can be constructed from a basis which embraces 
variables and just two further primitives: the familiar notions of 
inclusion and abstraction. Inclusion will be expressed by the usual 
form of notation ‘(z C y)’, which may be read ‘z is included in y.’ 
Abstraction will be expressed by the form of notation ‘v3 .. .’ as 
with Peano,! the blank being filled by any formula (statement or 
statement form); the whole may be read ‘the class of all objects x 
sucwatbaitie. . «7. 

A system of logic based on these primitives will be presented in 
this paper. The system involves the familiar theory of class types, 
which, for metamathematical convenience, will be recorded in the 
system by use of distinctive styles of variables; thus the variables 


‘zy’, ‘y’, and ‘z’, also with subscripts when further variety is needed, 


will take individuals as values; the variables ‘x”’, ‘y’”’, and ‘z’’, also 


with subscripts, will take classes of individuals as values; the vari- 
ables ‘x’”’, ‘y’”, and ‘z’”, also with subscripts, will take classes of 
classes of individuals as values; and so on. The metamathematical 


notion of term, covering all variables and also all expressions of 


Reprinted from the Journal of Symbolic Logic, Volume 2, 1937. An improve- 
ment which was described in a last-minute footnote of the original paper has 
here been worked into the text. 

1 In the notation of Principia Mathematica the sign ‘3’ is dropped, and instead 
a circumflex accent is placed over the initial variable; but this presents typo- 
graphical difficulties. 
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abstraction, is now describable recursively thus: a variable is a 
term, and the number of accents which it bears is called its rank; 
and if terms of equal rank are put for ‘¢’ and ‘y’ and a variable of 
rank n is put for ‘a’ in ‘a3(¢ C 7)’, the result is a term of rank 
n-+1.A formula, finally, is any result of putting terms of equal 
panletor ‘Cand ‘y’ in ‘(¢ Cm)’. 

Customarily, in logic, inclusion is thought of as applying only to 
classes, and as meaning that every member of the one class is a 
member of the other. But the above definition of formula counte- 
nances the notation of inclusion in application also to variables of 
rank 0, and hence calls for an interpretation of inclusion as applied 
to individuals. Various interpretations of this limiting case are 
compatible with the system which is to follow; we might interpret 
‘(x C y)’ as meaning that the individual z is spatially or temporally 
included in the individual y, or as meaning simply that x and y are 
the same individual. The latter course has the advantage of avoid- 
ing any assumption as to the nature of our individuals.? So far as 
the system is concerned, indeed, there is no need to posit individ- 
uals at all; the lowest type might be empty, so that the lowest 
occupied type would have the null class of individuals as sole occu- 
pant, and the variables of rank 0 might then be appropriated to 
this lowest occupied type. Again, the hierarchy of types might run 
downward infinitely, and the variables of rank 0 might be regarded 
as corresponding merely to the lowest logical type concerned in the 
context at hand. On either of these latter views, the values of vari- 
ables of rank 0 would enter the inclusion relation simply as classes. 
These two views and the more usual one, which countenances indi- 
viduals, are all suited equally to the system which is to follow. 

To facilitate metamathematical discourse about the system, the 
letters ‘a’, ‘6’, ‘y’, and ‘6’ will be used to denote any variables of the 
system; ‘f’, ‘yn’, and ‘@’ will denote any terms; and ‘7’, ‘v’, ‘¢’, ‘y’, 
‘x’, and ‘w’ will denote any formulae. When expressions denoted by 
these Greek letters are thought of as combined into a complex ex- 
pression by ‘C’, ‘3’, or other signs, the whole complex expression 
will be referred to metamathematically by combining the Greek 
letters themselves in corresponding, fashion and enclosing the 
whole in corners, ‘“’, ‘”. Thus '(¢ C 7)! is the expression which is 
formed by combining the terms ¢ and 7, whatever they may be, by 
an inclusion sign and parentheses in the indicated fashion. If ¢ and 
n are the respective terms ‘x’’” and ‘y’’3(y” C 2’’)’, then '(¢ C 7)! 

2 However, see Leonard and Goodman, 63f. 
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is the formula ‘(2’” C y’’3(y” C 2’’))’. In general, an expression 
beginning and ending in corners is to denote the expression which 
we obtain, from the expression between the corners, by replacing 
all Greek letters by the expressions which those Greek letters are 
intended to denote. The corners may thus be viewed as ‘‘quasi- 
quotes”; but they must not be confused with ordinary quotation 
marks. For example, '(¢ C 7)'is a certain unknown expression; it 
is not wholly specified until the terms ¢ and 7 are specified, and it 
becomes a formula if these are specified as of equal rank; whereas 
‘(¢ C n)’ is the specific non-formula depicted between the quotes, 
and containing the sixth and seventh letters of the Greek alphabet. 

a will be said to cover a given occurrence of 8 in ¢ (or in ¢) if that 
occurrence lies within a context of the form ‘a3’ within ¢ (or ¢). 
An occurrence of win ¢ (or in £) is bound in ¢ (or in ¢) if it is covered 
by a in ¢ (or in ¢); otherwise free. A variable is said to be free in 
@ (or in ¢) if it has a free occurrence in ¢ (or in ¢). Replacement of 
every free occurrence of a in ¢ by £ is called substitution of ¢ for a 
in ¢, provided that no variable which is free in ¢ covers a free occur- 
rence of a in ¢. 

We turn now to nine definitions, building up from the primitive 
notions of inclusion and abstraction. The definitions are to be re- 
garded as conventions of notational abbreviation extraneous to the 
formal system. The first introduces the material conditional: 


D1. "(¢D yw)" for “(a3s¢ C B3y)', where a is alphabetically the 
first variable of rank 0 which 1s not free in %, and 8 1s similarly related 


to y. 


The conformity of this definition to the usual sense of the sign 
‘>’ rests on the fact that, where a is not free in ¢ nor 6 in y, the 
respective classes denoted by 'a3¢'and "83" are universal or null 
according as ¢ and y are true or false.* So far as the meaning is con- 
cerned, a and 6 might be chosen as any variables of equal rank not 
free respectively in ¢ or ¥; to make the definitional abbreviation 
unambiguous notationally as well as interpretationally, however, 
the choice of a and £@ is specified uniquely in D1. For purposes of 
D1 we may fix the alphabetical order of variables of rank 0 arbi- 
tranily as ‘a’, ‘y’, ‘2’, ‘xy, ‘y:’, ‘21’, ‘te’, ... ; and fer purpésesse: 
the subsequent definition D8 we may order the variables of each 
higher rank correspondingly. 

The next definition introduces universal quantification: 

*See my System of Logistic, 37-38, 45. 
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D2. ‘(a)¢' for "(a3(¢ D $) C a3¢)’. 

The next introduces denial, utilizing the falsity of ‘(z’)(y’) 
(eC y’)’: 

D3. '~¢" for (oD (ey )@’ Cy’). 

The next three definitions are familiar: 

D4. "¢.y)' for '~(6 D ~y)". 

D5. '(¢ = ¥) "for (DY). HOD ¢))*. 

D6. '(ja)¢' for '~(a)~o". 


The remaining definitions introduce identity, unit classes, and 
membership: 


ee C= 7) ‘fer UG Gy) . GC 6)”. 


D8. ‘tt for "as(a = ¢)', where ats alphabetically the first variable 
of the rank of § which is not free in §&. 

oe? (Cen) for "oC 4)". 

The definitions D1, D4, and D5 are of course adopted for all 
formulae ¢ and y¥; D3 for every formula ¢; D2 and D6 for every 
formula ¢ and every variable a; D8 for every term £; and D7 and 
D9 for all terms ¢ and 7. Similar generality is intended for the rules 
and “‘metatheorems” which will follow. However, D7 is useful only 
where 7 is of the same rank as [, and D9 only where 7 is of next 
higher rank than §, since otherwise the expressions abbreviated are 
not formulae. 

Definition of all further notions of mathematical logic can now 
proceed along familiar lines. Relations, for example, can be intro- 
duced in terms of classes by Wiener’s method or Kuratowski’s.* 

Rules R1-5 will next be presented which generate the formal 
theorems of the system. In stating these rules the sign ‘+’ will be 
used to express the metamathematical property of being a theorem 
of the system—a property which the rules R1-5 themselves re- 
cursively define. Statement of the rules will be facilitated further 
by sometimes attaching exponents to first occurrences of Greek 
letters to indicate the relative ranks of the terms denoted. Thus Rl, 
‘e'((a® C 8") D (68 Cy”) D (aC 7¥)))"’ tells us that, where a, 
6, and y are any variables of equal rank (say n), "((a C B) D 
((6 C vy) D (a C y)))'is a theorem (better, an abbreviation under 
D1 of a theorem) of the system. Finally, for perspicuity most 

‘See next paper. 
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parentheses will be suppressed hereafter and dots will be used for 
punctuation in familiar fashion. 


Rl. +'e" C B*.D:8 Cy .D.aCy 

R2. +'a®3a(aey"t! > 6) C a3(ae B*t! .D. wey) .D.B Cy 
R3. If K¢@ then FTE"! C a3! 

R4. If +e and +'¢D wW' then Fy. 


R5.5 Lf k ¢, and ¥ is formed from ¢ by replacing an occurrence of 
"t €a3x' by the result (if any) of substituting ¢ for ain x, then FY. 


By way of establishing the adequacy of these rules, it will be 
proved that they provide all the theorems of Tarski’s system.® 
Tarski’s primitives are the four notions defined in D1-3 and D9 
above, together with variables like those of the present system. 
His rules are as follows (supposing D4-6 carried over): 


Cie ese Dy xeD x: 

T2516 ¢.D ¢'-. 

T3. F'@D. ~ OD Y" 

Ue F(a" =. ye 8") ier (eo. Dope 

T5. If a"t ts not free in 6, +" (daa)(B")(Bea.= ¢)1. 

T6. If H'}D (a)p' then FA Dy" 

T7. If H"@ D W", and a is not free in ¢, then "$ D (aw. 
T8. If  ¢, and y results from substituting 8" for a” in ¢, then Fy. 
T9. If +d and H'dD ' then Ky. 


A series of metatheorems M1-29, metamathematical theorems 
about the theorems provided by R1-5, will now be proved. This 
will establish the adequacy of the present system to Tarski’s; for 
T1-8 will all reappear among M1-29, and T9 is already at hand as 
R4. 


M1 (cf. T8). If kd, and y results from substituting ¢* for a” in ¢, 
then ky. 


6 R3 and R5 answer to the rules of subsumption and concretion used in my 
System of Logistic (43, 48). The rule of concretion was used by Whitehead and 
Russell (*20.3) and by Frege, 1893, 52. 

6 See Tarski, 1933, 97-103, especially Definitions 7 and 9. 


INCLUSION AND ABSTRACTION 105 


Proof. By 1st hp and R3 (and D9), KE '¢ ¢ a3"; .., by 2d hp and 
Rs, ty. 

m2 EGC 7” .Dt7 C O™ De CO. 

Proof from R1 by repeated use of M1. 


M3. If a” is free in neither (*t! nor n**, 


ETa3s(aen.D 6) Ca3s(aef.D.aen).D.§ Cn" 

Proof similar from R2. 

Ma(=T1). -'e@Dy.Di¥ D x.Di¢ D x". 

Proof: M2 (and D1). 

M5. Fg Di ¢D¥.D 

Proof. Let a and B be asin D1. By M2 (and D1, D9), 

tE'aea3d.:¢Y.D. ae Bay" 
M5 then follows, by R35. 

M6. E'? D:¢D ¢._ $1 Proof: M5. 

Bee ae? Oh DX D:D ae 

M8. F'¢ D.W~Dx:D:ixDw.DidD.yDw 

ao? Oey. iy ) xX .D:% Dio.D.% Dia! 

Proofs of M7-9, from M4-5 (and R4), are analogous to the 
proofs of 2-7, 2-87, and 2-94 from 2-1 and 2-6 in my System of 
Logistic (72-74). 

RHO. "dD ¢'. 

Proof. By M7, tL '[M6] > 

HS D's-D. ¢ De. (1) 
Let a be asin D1. By M3 (and D1), 
K'a3s(aea3d.. >) Ca3(aea3d._. a€a3¢).D.¢_) 6"; 
.., by R5 (and Di), t+ '[(1)] D [M10]*. 


The above proof is abbreviated in ways which will be usual here- 
after. ‘[M6]’ is short for the whole expression ‘¢ ):¢ _) ¢._ ¢’ 
which stands between corners in the metatheorem M6. ‘{10]’ and 
‘[((1)])’ are analogous. The end of the proof involves tacit use of R4 
in obvious fashion; similarly the ends of the proofs of M14 and 
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M18, below, involve two and three applications of R4. When R4 
is used in a non-terminal step of a proof, it is indicated by renewal 
of the signs ‘+’ without closure of the preceding line; thus the 
step leading to (1) in the above proof amounts to the following: 


By M7, [MG] Smee. . b Dih”, 
.., by M6 and R4,+'d?D¢.D.¢D 1 (1) 


Similarly the step leading to (3) in the proof of M18 amounts to the 
following: 


By M8, -"[M6] 5::. [M6] >: D:.¢D¢.D:¢D¢ Oey 
.., by M6 and R4, F'[M6] D::¢D:.¢D¢.D:¢D¢.D $'; 
.., by M6 and R4,+'¢D:.¢6D¢.D:¢D¢.D o'. (3) 

Ma. EH iptG ee 0:0" Ce.D.f Ce. 

Proof: By M7, /"[M2] D> [M11]7. 

M12. E"(a%)¢.D. (+! C a3d". 

Proof. By M10 and R38, E'E C a3(¢ D ¢)1. (1) 
By M2 (and D2), -[(1)] D [M12]. 

M13. E'(a")¢.D. f*€a3¢'. Proof: M12 (and D9). 


M14. If a* ts free in neither §°*) nor n*t, 


E'(a)\(aef.D.aen).D.£ C7". 


Proof. By M12, t"(a)(@ef.D.aen) .D.a3(aen.D ¢) C 
a3s(ael...aen)' (1) 


By M4, tT(1)] D. [M3] D [M14]2. 
M15. If /¢ then E'(a)¢'. Proof: R3 (and D2). 
Mi16. If F'¢ D v' then E'(a)d.D. (ay 


Proof. By hp and M15, E'(a)(¢ Dy)". (1) 
By M14, L'(a)(aea3d.D.a€a3y) .D. a3¢ C asp; 
es, EM] D 

t+ 'a3s¢ C as" (2) 
By M12, L"(a)d .D. a3(¥ D vy) C a3¢" (3) 
By M11 (and D2), -K-'[(2)] D 


t'a3(y Dv) C a3¢._. (a)y". (4) 
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By M4, + '[(3)] D. [(4)] D [ths]". 


Here ‘hp’ and ‘ths’ refer respectively to the hypothesis 
‘-'s > w” and the thesis ‘K '(a)¢.D. (a)y” of the metatheorem 
which is being proved. Thus ‘[ths]’ here is short for ‘(a)¢ .D. (a)y’. 


M17 (=T6). If F°¢ D (a)W" then F'@ DW". 

Proof. By M13, EK '(a)y.D. ae asp’; 
.., by Ro, ET(a)y Dy". (1) 
By M4, t [hp] >. [(1)] D [ths] ’. 

M18 (=1T7). If +'é D Ww), and a is not free in ¢, then +"? D 
(a)y". 

Proof. Let 8 be alphabetically the first variable of rank 0 which 
is not free in ¢. By M3 (and D2), 


LK 'B3(6B e asd .D ¢) C 
B3(8 € a3(¢ - $) .D. Be a3¢) ._. (a)d!; 


.., by 2d hp and R5, F'83(¢ D 4) C B3( D $.D 4) .D. (ale; 


i.e. (by D1), PS Del Bee. De Do@e. W 
By ist hp and M16, E'(a)¢.D. (a)y". (2) 
By M8, L'[M6] >. [M6] D 

-'é D2 Dig D1 DS. 6". (3) 
By M9, EH ((3)) D:(@)] S. (2)! > Tes)". 


M19 (=T2). F'~¢D¢.D ¢. 
Proof. Let a be as in D1. By M8 (and D2-3), 
L'a3s~(aea3d) C a3(aea3($ D ¢) .D. a € @3¢) .D. (a)$'; 
.., by R5 (and D1), E'~¢ .D:¢D¢.D¢.:D. (a)¢); 
~. ey N17, E'N¢.D:¢D¢.D¢.ID ob (1) 


By Mil (and D1), -'[M6] D> 
OD. 4D: 9D IDs" 3?) 
By M4, 2) > tay > me)". 
M20 (=T3). k'@¢D.~¢ Dy. 
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Proof. By M5 (and D3), H'¢ D. ~¢D (2')(y)(e’ Cy’)™ @) 


By M13, Ge GH) Diezer syle Gaae 
., by Rd (and D9), re) Q)@ Cy) .D. yD@ey’)’. @) 
By M13, EY )(ey’) .D. zapey’3(zey’) 
“sige RS, E'(y’)\(zey’) .D.zez3y": 
Be our ean K'(_y)(zey’) DW. (3) 
By M4, mee) DENG. 
Ei) y)@' Dy) dD (4) 

By M8, -"[(1)] D. [4] D [M20]". 

M21. F'¢D ~y¥.D.¥D ~@'. 

wi22, "6 = ¢'. 


M23. F'¢=vV.D.¢@D v1 
M24. F'¢=y.D.¥ D ¢1. 
M25. F'¢?Dv.D:.¢dDx-Di¢dD.v.x! 


626... -'s D. v Dat (Di De xX DwibD:ist aw. iy 
> @ mee 


Proofs of M21-26 may be omitted, for it is known that all formu- 
lae which are tautological, or valid by virtue solely of their struc- 
ture in terms of the truth functions, are provided as theorems by 
R4, M4, M19, and M20.’ 


M27 (=T5). If a*+' 7s not freein o, K'(Aa)(8") (Bea .= ¢)' 


Proof. By M13, 
(a) ~ (8)(Bea.= $) .D. B3peas ~ (B)(Bea. 


.., by hp and R35, 
E'(a) ~ (6)(Bea.= ¢) .D. ~ (6)(B € B3¢ .= ¢) |; 


“., by R65, LE '(a) ~ (8)(Bea.= ¢) .D. ~(6)(¢ = 4)". (1) 
By M22 and M15, +'(8)(¢ = ¢)7. (2) 
By M21 (and D6), ETd)) >. fey) > BeaL 


¢)'; 


7 See §2 of preceding paper. 
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R28. EPS a DD FeO Die 
Proof. Let a” be free in neither ¢ nor 7. By M2, 
Sa ee eee eae (1) 
By M11, - oC mare 4 fs yac |. (2) 


By M26 (and D7), EF '[(1)] D. [@)] D 
KS =7.Dia=7n7.D.a= 0); 


.., by M18, Kg =9.D. (ala =9.D.a =f)". (3) 
By M14, t+" (a) (@eiy. Dna ef) ._DaayiG a"; 
eas Gnd'Ds), H"@ia=7500sO.DeCce. 
iy M2(and.D9), Ll Ce. D: F480 .D. wel! (5) 
By M9, E{(3)] D: [(4)] D. [()] D [M28]. 


M29 (=T4). E'(7")(y € att! .=. y € 6") .D. 
(Om) (qe 6 2). B @8)r. 


Proof. By M23, E'yea.=.yeBiDiyea.D.yveB); 
., by M16, E'(vy)(yea.=.ye8)... (y)(vea.L. 7 €B)* (1) 


By M14, iQ ea. Day ae. )- a © # (2) 
By M4, (SaaS 

GG area) mc Gir. (3) 
Similarly, using M24 instead of M23, 

t'iy)\(yea.=. ye) .D.8 Ca’ (4) 
By M25 (and D7), F'[(8)] D. [(4)] D 

ET)y ea .=. ye) .D.a = 8" (5) 
By M28, Fane — 6.) ae). Gieon, 
”., by M18, "ea = 6 .D. O)(@ed.>. B e's)". (6) 


By M4, F[(S)] D. [(6)] D [M29]. 


VILti 


at On Ordered Pairs 


and Relations 


Wiener, in 1914, reduced the theory of relations to that of classes by 
construing relations as classes of ordered pairs and defining the 
ordered pair in turn on the basis of class theory alone. The defini- 
tion, as improved by Kuratowski, identifies the ordered pair 2;y 
with we U tia U cy). 

In terms of Russell’s theory of types, x;y in the above sense is two 
types higher than z and y. Even when we abandon Russell’s theory 
of fixed types of objects in favor of a theory of stratified formulae,! 
there is still significance in saying that ‘x,y’ is of type 2 relative to 
‘vy’ and ‘y’—meaning that a test of the stratification of any context 
involves assigning a higher number by 2 to ‘z;y’ than to ‘z’ and 
ty’. 

In 1941, I proposed a definition of the ordered pair which re- 
duced the type difference to 1, construing z;y as the class of unit 
subclasses of « and complements of units subclasses of y.2 Goodman 
then presented a more complicated definition in which my reduc- 
tion of type was coupled with another advantage, that of im- 


This paper has been formed by telescoping two notes, ‘“‘On ordered pairs’ and 
“On relations as coextensive with classes,’ which appeared in the Journal of 
Symbolic Logic, Volumes 10 and 11, 1945 and 1946. 

1 This is done in my “‘New foundations” (see next paper) and ‘“‘On the theory 
of types,’’ in Hailperin’s ‘‘A set of axioms for logic,’’ and, with ill consequences, 
in my Mathematical Logic (1940; see paper after next). 

2 See Goodman’s footnote 5. Note that the concluding twelve words of that 
footnote are in error. 
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mediate extensibility beyond ordered pairs to the general case of a 
sequence of any finite length. 

The purpose of the present note is to show that the type differ- 
ence can be reduced to 0; 1.e., that ‘z;y’ can be so defined that it 
turns out to be of the same type as ‘xz’ and ‘y’. Thereupon sequences 
of any finite length x;y;z;w; . . . can be obtained by iteration as 
z;(y;(z;(w; . . .))) without increase of type. Relations of all de- 
grees thus come to be of the same type, relatively to the objects 
which they relate, as classes of those objects. But the definition will 
be of narrowly theoretical interest only, being too inelegant to 
aspire to adoption as a standard version of the ordered pair. 

For general applicability, this new definition will depend on the 
assumption that all entities are classes—an assumption upon 
which my previous definition and Goodman’s likewise depended. 
This assumption is fulfilled if individuals are identified with their 
unit classes; i.e., if membership in individuals is explained as 
identity.* This is compatible with Russell’s theory of types, modi- 
fied to the extent of accrediting all individuals to all types.‘ 

The rationale of the new definition is as follows. I define a certain 
notion ‘6z’ in such a way as to provide that 

(i) ‘@z’ is like ‘z’ in type, 

(ii) there is no object z for which 0 « @z, 

(iii) given 6z we can determine z uniquely. 

This done, I take x;y as the class whose members comprise the 
correspondents 6z of the several members z of x, together with the 
“tagged”’ correspondents 6z LU .0 of the several members z of y. Le., 


te wazeen.w = @2.V. ey. & = 02 VU 0). 


This will meet the formal requirement which any satisfactory defi- 
nition of the ordered pair must meet, namely that x and y each be 
uniquely determined by 2;y. For, given x;y, in view of (ii) we can 
spot the correspondents of members of x as those members w of 
x;y such that ~ 0 w; and given these we can in turn get 2, in view 
of (iii). Similarly we can get y from the other members of x;y, after 
dropping 0 from each. In brief, 


x = £(02 € x;y), y = 2(62 U 0 e x;y). 


So it remains only to define ‘@z’. We can realize (i)—(iii) by con- 


$ This is done in ‘“New foundations,” 71, and Mathematical Logic, 122, 135. 
4See Goodman, 151. 
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struing 6z as formed from z by adding 1 to all natural numbers 
belonging to z, leaving other members of z unchanged. I.e., 


dz = i(av)(vez:~veNn.u=v.V.veNn.u = Sv) 


where Nn is the class of natural numbers and Sv is the successor of v. 
Clearly this version of ‘62’ fulfills @). Also it fulfills (11), since 0 e Nn 
and (v)(0 ¥ Sv). Also it fulfills (aii); for, 


z= v(~wveNn.vedz.V.veNn. Sve). 


The arithmetical notations used above must of course be defined 
without presupposing the theory of relations, and thereby the 
notion of ordered pair, if we are to avoid circularity. But this can 
be done in familiar fashion: 


0 =A, 
Sv 
Nn 


é(ay)(yer. x) wer), 
4a) Oey. OCG ey. Deezey) .D. 2 ey). 


The special advantage claimed for this definition of the ordered 
pair, over the simpler alternatives due to Wiener and others, was 
that it causes x;y to be of the same logical type as x and y. The 
definition has also the following further advantage: 7t makes every- 
thing an ordered pair. Any object a whatever becomes identical 
with the ordered pair 


2(0z €a); 2(0z2 U Wea). 


Every class accordingly comes to be a class of ordered pairs, i.e., 
a relation. Indeed, everything comes to be at once an ordered pair, 
a class, and a relation. The terms ‘ordered pair’, ‘class’, and ‘rela- 
tion’ fuse and become valueless as designations of categories, be- 
cause all-embracing; but they retain importance still as relative 
terms—thus an object a is in general an ordered pair of certain 
objects b and c, a class of different objects d,e,f, ... , anda rela- 
tion of different objects still. 

There results a notable simplification in the relationship between 
the calculus of relations and the calculus of classes. So long as there 
were objects which were not ordered pairs, the universal relation 
(class of all ordered pairs) had to be distinguished from the uni- 
versal class V. Likewise the relational complement of a relation x 
(viz. the class of all ordered pairs foreign to x) had to be distin- 
guished from the classial complement ¢. These distinctions were 
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made in Principia Mathematica (Vol. 1, 213-215), and in my 
Mathematical Logic (205-209), by applying a dot to the relational 
analogue. But when everything becomes an ordered pair, these 
distinctions drop out. What had been a relational counterpart of 
the calculus of classes becomes identical with the calculus of 
classes; for relations and classes come to be one and the same thing. 

Simplification results also in connection with such further rela- 
tional concepts as converse, relative product, and ancestral. So 
long as there were objects which were not ordered pairs, the princi- 
ples 


x=k «Cy=fC¥ vwc=2xl|ll, «rC*z 


(for example) held only under the hypothesis that x is a relation. 
Or, what comes to the same thing, they held only when modified 
by restricting the class x, at its first mention in each of the four 
principles, to its relational part.’ But when everything comes to be 
an ordered pair, these four principles and all others like them be- 
come valid without restriction. The notion of relational part drops 
out altogether, for the relational part of a class comes to be the 
class itself. 


§ J.e., the class of all ordered pairs belonging to zx. It is expressed in Mathe- 
matical Logic by applying a dot. 


TX 


at On w-Inconsistency and a 


So-called Axiom of Infinity 


The purpose of the present paper! is to dispel such mystery as may 
be traceable to use of the ill-chosen term ‘w-inconsistency’ and to a 
certain application of the term ‘axiom of infinity’. The application 
of ‘axiom of infinity’ which I have in mind is made by Rosser in his 
penetrating studies of the system of my ‘‘New foundations.” Let 
me begin with a synopsis of that system, which I shall call NF. 

NF assumes as primitive Just the membership connective ‘e’, the 
truth functions, and quantification over a single style of variables 
‘x’, ‘y’, etc., without type distinctions. The axioms, superimposed 
on standard quantification theory, comprise the axiom of exten- 
sionality ‘(xz)(cey.=.cvez).yew._.zew’ and the axioms of 
abstraction, these latter being all sentences of the form ‘(dy)(z) 
(c ey .= Fx)’ in which the sentence supplanting ‘Fx’ is stratified 
(i.e., adaptable to type theory by some assignment of types to 
variables). 

Identity is defined in NF in a familiar way, and then singular 
description is defined contextually by essentially Russell’s method. 
Class abstraction, next, is defined by the schema: 


Fx =a (yy) (a) (a ey .= Fa), 
and wz, V, and A are then of course defined as #(y = x), (x = 2), 
This paper, first drafted in May 1951, went, in January 1952, to the Journal of 


Symbolic Logic, from which it is here reprinted (Volume 18, 1953). 
1] gratefully acknowledge Rosser’s helpful criticism of earlier drafts. 
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and £(z ~ x). 0 and successor (Sx) are defined in Frege’s way, as 
cA and 9(az)(zey.d(wey.w # 2) €2). 

Existence of each of the classes A, cA, wA, etc., is provided by the 
axioms of abstraction, and the distinctness of each from each is also 
readily proved; so we are assured that any model of NF must be 
infinite. Moreover some of the classes provided in NF must indi- 
vidually be infinite; notably V, which contains everything. 

But how is the existence of infinite classes, or say the infinity of 
V, to be expressed within NF? A natural suggestion is ‘(7)(z « Nn 
._). V €x)’, where Nn is the class of all natural numbers. Nn itself 
can be defined, it would seem, by Frege’s method as 


(1) £(y)Oey. (z\(zey.D.Szey).).zey), 


and its existence is assured by the axioms of abstraction. A shorter 
formula which is easily proved equivalent to ‘(x)(z e Nn ._. V é2)’ 
is ‘A €Nn’,? so Rosser has called ‘A é¢ Nn’ the axiom of infinity for 
NF. It seems odd that no way of proving it in NF has been found,’ 
despite the apparent fact, argued above, of its intuitive truth for 
NF. Is it possible that a finitude axiom ‘A e Nn’ is compatible with 
NF despite the obvious inadequacy of a finite model? 

This seeming paradox may, before we resolve it, be pressed fur- 
ther. It is easy in NF to prove each of the theorems ‘A ¥ 0’, 
‘A #1’, ‘A ¥ 2’, etc., individually. Yet, no way having been found 
of proving ‘A é Nn’, it is conceivable that ‘Ae Nn’ can be added 
without formal contradiction. The resulting system exhibits what, 
following Gédel (1931), has come to be called w-inconsistency; that 
is to say, each of 0, 1,2, . . . can be proved to fulfill a certain con- 
dition (in this case non-nullity), and yet it can be proved (in 
‘Ae Nn’) that some natural number violates the condition. Such 
w-Inconsistency does not imply real inconsistency (i.e., occurrence 
of theorems of the form ‘pj’), but it seems equally intolerable. 

However, the whole situation comes to appear much less queer 
if we critically re-examine the Fregean version (1) of Nn. Its 
rationale is that 0, 1, 2, . . . comprise all and only the common 
members of all classes which contain 0 and are closed with respect 
to successor. Now obviously 0, 1,2, . . . areindeed common mem- 
bers of all classes which contain 0 and are closed with respect to 
successor. The question is whether 0, 1, 2, ... exhaust those 

2 Cf. T10 in Rosser, 1939. 


* Note added 1965: Specker subsequently found a proof and published it 
in 1953. 
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common members. The answer to this question depends on what 
classes there are, in NF (or in a given model of NF), for us to take 
common members of. There is no assurance, in the axioms of ab- 
straction of NF, that every intuitively conceivable class of entities 
of NF must exist as an entity of NF. 

Case 1: A class containing just 0, 1, 2, . . . does exist. Then, 
since this is one of the classes which contain 0 and are closed with 
respect to successor, we are assured that 0, 1, 2, . . . do exhaust 
the common members of such classes. Accordingly (1) expresses the 
genuine Nn. 

Case 2: No class containing just 0, 1, 2, . . . exists; all classes 
big enough to contain all of 0,1, 2, . . . are too big to contain just 
these. Then (1), which expresses a class for NF in any case (by the 
axioms of abstraction), does not express the genuine Nn; it merely 
expresses a certain class to which all of 0, 1, 2, . . . belong among 
other things. One of those other things may well be A, despite infin- 
ity of V. Infinity of V precludes nullity of any of 0, 1, 2, 
but it need not prevent A from belonging to the class eniTeaeil in 
(1) in case that class includes other things besides 0, 1, 2, 

It is therefore misleading to define ‘Nn’ as (1) and then speak of 
‘A €Nn’ as the axiom of infinity. This course is appropriate only in 
Case 1. If ‘Ae Nn’ is in fact demonstrable in NF, or better if 
‘A e K’ is demonstrable where ‘K’ is defined as (1), then we can 
make sense of the situation only by acquiescing in Case 2 and con- 
cluding that neither K nor any class, for NF, contains all and only 
0,1,2,...;Aisnoneof0,1,2,... ,and‘A eK’ means simply 
that every class which contains all of 0, 1, 2, . . . contains A as 
additional member.* 

There is evidence that NF must in fact be allocated to Case 2, 
unless we are prepared to settle for some heterodox results in the 
theory of ordinal numbers or elsewhere. Rosser and Wang have 
shown in effect that any model of NF which interprets the ‘=’ of 
NF as genuine identity, and interprets the ‘0’, ‘1’, etc., of NF as the 
natural numbers, and interprets K (defined by (1)) as containing 
only the natural numbers, must fail to preserve well-ordering in the 
modeling of a theory of ordinal numbers constructed in NF. They 
sum up this situation in Henkin’s terminology by saying that NF 


4 Incidentally it can be shown that if every such class contains A as addi- 
tional member then every such class contains also other additional members; 
viz., eV, also the class of complements of unit classes, also the class of comple- 
ments of two-member classes, and so on. 
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admits of no standard model. They show also that the other famil- 
iar logics are in a somewhat related predicament—viz., that none 
of them can be proved, even by means of certain stronger logics, to 
admit of a standard model. But in the case of NF the impossibility 
of a standard model is proved outright. 

We saw earlier that NF, with ‘A e Nn’ proved or added, would 
be w-inconsistent, and this eventuality seemed as intolerable as real 
inconsistency; but the matter took on a very different complexion 
when we took ‘K’ rather than ‘Nn’ as our abbreviation of (1). Now 
a similar reflection dispels the queerness of w-inconsistency in gen- 
eral, also apart from NF. Where w-inconsistency co-exists with 
simple consistency, there is nothing outlandish afoot; there is 
merely a predicate, misinterpreted as “is a natural number’’, which 
has proved to be true of some other things besides the natural num- 
bers. 

Moving to a closer examination of the general notion of w-incon- 
sistency, let us consider an at least partially interpreted formal 
deductive system © which contains, among other notations, the 
familiar notations of the truth functions and quantification subject 
to the usual interpretations. Suppose that © is consistent (in the 
ordinary sense of the word) and that its theorems proceed from 
axioms by the usual logic of truth functions and quantification. 
The universe appropriated to the variables of quantification is to 
be interpreted as including, among other things, all of 0, 1, 2, etc., 
ad infinitum. 

Suppose further that each of 0, 1, 2, etc., is expressible in S at 
least indirectly, in the following sense: there is a standard method 
whereby, for each natural number 27 and each sentence ¢ of © in ‘z’ 
(i.e., having ‘x’ as sole free variable), a statement (or closed sen- 
tence) ¢; of S can be formed which is true (under adopted inter- 
pretations of the notations of ©) if and only if ¢ is satisfied by 7 as 
value of ‘x’. (If S contains explicit names of numbers, ¢; might be 
formed from ¢ by substituting the name of 7 for ‘x’. Otherwise ¢; 
might be taken as "(x)(¥ D ¢) 'where y is some standard equivalent 
of ‘x = 2’.) Let us then speak of ¢ as numerically general if ¢; is a 
theorem of © for each 7. 

Now we can say what it would mean for © to be w-inconsistent. 
It would mean that there is a sentence ¢ in ‘x’ which, through 
interpretations supplementary to those exploited above, is inter- 
preted as ‘x is a natural number’, and that there is a numerically 
general y such that "(4x)(¢y)1is a theorem of G. 
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It may be presumed that the choice in © of a translation ¢ of 
‘x is a natural number’ was not wholly capricious, and in particular 
we may reasonably require that the chosen ¢ be numerically gen- 
eral. (Under w-inconsistency, certainly, we can find numerically 
general sentences without resorting to sentences which are true of 
everything; witness y above.) Suppose then that © is w-inconsist- 
ent, i.e., that there are ¢ and y as described in the preceding para- 
graph, and that ¢ is numerically general. Then ‘dy ‘is also numeri- 
cally general, and hence not too restrictive to serve as a translation 
of ‘x is a natural number’. Moreover, '¢y' is more accurate than 
¢ as a translation of ‘x is a natural number’; for ¢ is true not only of 
everything that "¢y' is true of, but also of some things that "¢y" is 
not true of (since "(4z)(¢y) ‘is a theorem). It behooves us therefore 
to correct the translation in © of ‘x is a natural number’, to read 
"ow '; and then to re-examine © for w-inconsistency. 

Under the corrected choice of translation © may still be w-incon- 
sistent; for there may be a numerically general x such that '(4z) 
(¢vx) ‘is a theorem. If so, then we can further improve the transla- 
tion of ‘x is a natural number’ by taking it as '¢yx', and try again. 

This series of corrections may or may not terminate. If it termi- 
nates, it terminates in a numerically general conjunction ¢’ which, 
if taken as the translation in © of ‘zis a natural number’, leaves the 
system w-consistent. In case the series terminates, therefore, the 
w-inconsistency originally supposed is seen to be of no logical inter- 
est; it consisted merely in wanton choice of the broader ¢ instead of 
the narrower ¢’ for the reading ‘x is a natural number’. 

Surely then the only case of w-inconsistency worth studying is 
the case of non-termination. In this case let us call © numerically 
insegregative. The situation here is simply that for every numerically 
general ¢ there is a numerically general y such that '(az)(¢y) 'is a 
theorem of ©. Here the so-called w-inconsistency of © cannot be 
blamed on the wanton choice of a worse candidate ¢, instead of a 
better candidate ¢’, for the interpretation ‘x is a natural number’. 
It can still be blamed on a wantonness of interpretation, since if © 
is numerically insegregative we would do better to face the fact 
that ‘x is a natural number’ has no translation into © at all. The 
term ‘w-inconsistent’ is thus misleading also here, but at least there 
is in this case a logically significant situation which deserves notice 
under some better terminology. I have now proposed such a termi- 
nology: numerical insegregativity. 
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The property of numerical insegregativity may be restated in- 
tuitively thus: every sentence ¢ which 1s demonstrably true of all 
natural numbers (i.e., numerically general) 7s also demonstrably true 
of something else (i.e., ‘(4r)(¢y)' is demonstrable for some numeri- 
cally general Y). In such a system there is no proper translation of 
‘ce is a natural number’, but only an infinite series of better and 
better approximations (see next to last paragraph above). 

Note that a system © is numerically insegregative relatively only 
to an at least partial interpretation of its signs: normal interpreta- 
tion of the notations of truth functions and quantifiers, interpreta- 
tion of the universe of quantification to include all of 0, 1, 2, etc., 
and a determination in effect of the several numerals (sufficient to 
determine ¢; for every ¢ and 7). On the other hand the notion of 
numerical insegregativity does not, like that of w-inconsistency, 
depend on any preassigned translation of ‘xz is a natural number’. 

The primary interest of the notion of numerical insegregativity 
consists in this: as has been seen, so-called w-inconsistency reduces 
to numerical insegregativity the moment we repudiate those 
captious cases which depend merely on neglecting the best avail- 
able translation of ‘x is a natural number’. Insofar as the notion of 
w-inconsistency is found to be applicable beyond the bounds of 
numerical insegregativity, it can be made inapplicable by a better 
choice of translation of ‘xz is a natural number’ in the system under 
investigation. Insofar as the notion of w-inconsistency is interest- 
ing, the notion of numerical insegregativity is interesting—and less 
misleading. A numerically insegregative system is not absurd, or 
incapable of coherent interpretation, as the phrase ‘w-inconsistent’ 
suggests. It is merely uneven in its power of expression. It is too 
weak to express a necessary and sufficient condition for member- 
ship in the series 0, 1,2, . . . , but itis strong enough to be able to 
better each of its own approximations. 

The term ‘w-inconsistent’ has also another misleading connota- 
tion. On the analogy of genuine inconsistency, any system contain- 
ing an w-inconsistent system as a part would be expected to be 
w-inconsistent. Actually, as Rosser remarks (1952), NF might be 
w-inconsistent and yet a stronger system containing NF as a part 
might be w-consistent. He concludes that ‘‘such questions as 
whether a logic is w-consistent . . . are rather more subjective 
than objective,” but he might better have concluded that the 
terminology is misleading. It is obvious that a contained system 
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might be numerically insegregative and a containing system nu- 
merically segregative, since in the containing system there is more 
machinery for numerical segregation. 

At this writing some doubt remains as to the exact relationship 
between numerical insegregativity and the impossibility of an ad- 
missible translation of ‘x is a natural number’. Consider a system 
© and a sentence ¢ therein, with free ‘zx’. Relatively again to an 
assumed partial interpretation as of three paragraphs back, we 
might speak of ¢ as an admissible translation of ‘x is a natural num- 
ber’ if there is at least some way of so finishing out that partial 
interpretation as to cause ¢ to be true of all of 0, 1, 2, ... and 
false of everything else. Now I know of no proof that if © is nu- 
merically segregative (i.e., not insegregative) it is bound to contain 
an admissible translation of ‘z is a natural number’. What is obvi- 
ous is only the converse. 

Whether in particular NF is numerically segregative is an open 
question; and whether it contains an admissible translation of ‘z is 
a natural number’ is an open question. If ‘A « K’ can be proved in 
NF, then certainly the defining condition of K, viz.: 


(2) (y)Ocy. ()(zey.D.Szey) .D. rey), 


is not an admissible translation of ‘x is a natural number’; but NF 
might, even under these circumstances, contain a formula other 
than (2) which is an admissible translation of ‘x is a natural num- 
ber’. Some unstratified formula might serve as a translation of ‘z is 
a natural number’ without even guaranteeing existence of the class 
Nn of natural numbers. Two unstratified formulas which suggests 
themselves as possibilities are: 


reK .G(yeK.x2 Ky. (Az\(Aw(zer.z2z(\wey)) €2, 
(Gy)\(cey.yeSz. (z)zey.Diz2=0.V Gw)(z = Sw.wey))). 


at Element and Number 


1. Inconsistency of *200. Rosser pointed out (1941) that the sys- 
tem of my Mathematical Logic, when curtailed by leaving out the 
elementhood principle *200, would hold true even of a universe 
wherein the only entity is the null class. In other words, all theo- 
rems of the curtailed system remain true when all atomic formulae 
Tae B' are construed as false for all values of their variables. This 
observation constitutes a consistency proof of that curtailed sys- 
tem, in an extraordinarily strict sense; for it shows that every theo- 
rem of that curtailed system is of a very special kind such that if all 
its quantifiers be simply rubbed out and all its atomic formulae be 
marked ‘F’ then the whole will receive a “T’ under the ordinary 
truth-table computation. 

Thus, as I remarked in my review of Rosser’s paper, “‘the system 
exclusive of *200 is a completely safe basal logic, to which more 


This paper was called forth in 1941 by Rosser’s discovery that the system of 
set theory in the first edition of my Mathematical Logic was inconsistent. A 
makeshift repair was made in the remainder of that edition by means of a 
corrigendum slip, and incorporated also into the 1947 printing; but the de- 
mands of brevity precluded, there, any such broad revision as was set out in 
this paper. In 1950 Wang hit upon a different repair, nearer in spirit to the 
book; and it, rather than the present paper, set the form of the revised edition 
of Mathematical Logic in 1951 and later printings.—At points the present paper 
calls for consultation of Mathematical Logic, even invoking theorems of that 
book by numerical reference in proofs. These numbers refer to the first edition, 
but readers using the revised edition will have little trouble tracing the few 
discrepancies. Such dependence on outside material is not usual in the present 
volume, but it seemed allowable here partly because of the easy accessibility of 
Mathematical Logic and partly because of what can be gleaned from the paper 
without bothering with that book. The paper is reprinted from the Journal of 
Symbolic Logic, Volume 6, 1941. A pair of definitions in §6 has been corrected in 
a small way. 
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daring structures such as *200 may be added at the constructor’s 
peril.”” The review was written months ago, on the basis of galley 
proofs of Rosser’s paper. But the remark now takes on a prophetic 
ring, for news is forthcoming that the system including *200 is 7n- 
consisient. 

The contradiction is the Burali-Forti paradox, wherein it is 
argued that there can be no greatest ordinal number, and yet that 
the ordinal number of the series of ordinal numbers must be the 
greatest. The discovery that this paradox is implicit in Mathemati- 
cal Logic is due, again, to Rosser. He has carried through the formal 
derivation of the contradiction, a long and complicated matter, in 
clear and incontrovertible detail. 

The system that has thus been proved inconsistent embodied a 
synthesis of two features, whereof one came from von Neumann’s 
set theory and the other from my ‘‘New foundations.”’ The one 
feature is the recognition of non-elements; the other is the appeal 
to stratification. The synthesis was effected by taking stratifica- 
tion as a sufficient condition of elementhood; *200 says, roughly, 
that stratified functions of elements are elements. 

It is by no means clear that the contradiction infects either of the 
two antecedent systems. Von Neumann’s system is saved, pre- 
sumably, by the circumstance that many classes that come to be 
elements under the stratification principle *200 cannot be shown 
to be elements under von Neumann’s own elementhood axioms. 
For purposes of the present paradox, one crucial class of this sort 
would appear to be the class of all ordinal numbers; another, the 
relation of greater and less among ordinals. 

“New foundations,” on the other hand, is presumably saved by a 
different circumstance: the circumstance that stratification figures 
there as a condition of existence rather than merely of elementhood. 
Derivation of the paradox involves use of a certain class, called Q 
by Rosser,’ whose defining condition is unstratified. Since the use 
made of Q happens to be independent of any considerations of 
elementhood, the want of stratification is no protection for Mathe- 
matical Logic; whereas for ‘‘New foundations” this want of strati- 
fication effectively obstructs the proof that the required class Q 
exists at all. 

The particular obstacle that seems to bar the paradox from von 


11942, second proof of ¢841. Note that the A of Rosser’s first proof of {841 
likewise has an unstratified defining condition. 
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Neumann’s system is absent from ‘‘New foundations,” and vice 


versa. But Mathematical Logic lacks both obstacles, and the para- 
dox comes to be derivable. The same would seem to be the case, 
moreover, in the system of Principia Mathematica, so long as we 
take typical ambiguity seriously; i.e., so long as we leave the vari- 
ables without type indices and construe the theory of types merely 
as prescribing a stratification test for meaningfulness of formulae.” 
The system of Principia so conceived is not protected from the 
paradox in the way that von Neumann’s system seemed to be; in 
this regard the system of Principia behaves exactly like that of 
“New foundations.’”’ But neither is the system of Principia pro- 
tected from the paradox as was the system of ‘“New foundations” ; 
for it happens that the class Q alluded to earlier becomes for pur- 
poses of Principia a “heterogeneous relation” (relating consecutive 
types) whose defining condition zs stratified. Whereas in ‘‘New 
foundations” the expressions corresponding to heterogeneous rela- 
tions are short for unstratified class expressions, in Principia rela- 
tions have an irreducible status and may be heterogeneous without 
any hindrance from the stratification requirement. Thus it is that 
in order to avert Burali-Forti’s paradox the authors of Principia 
felt called upon to suspend typical ambiguity and introduce explicit 
type indices at the crucial point.* The mere requirement of strati- 
fication does not suffice to bar Burali-Forti’s paradox from the sys- 
tem of Principia, though sufficing to bar Russell’s paradox and 
the infinite bundle of related paradoxes.‘ 

Note that these remarks relate to Principia with its unreduced 
theory of relations, and not to that simplified variant of Principia 
wherein the theory of relations is reduced to that of classes.> That 
simplified variant enjoys the same protection as does ‘‘New founda- 
tions.” It is only in the ground-floor admittance of heterogeneous 
relations that Principia gains an added freedom demanding type 
indices for its control. 

Burali-Forti’s paradox has an analogue in Cantor’s paradox, 


2 The sense of stratification concerned here is more elaborate than in ‘New 
foundations” or Mathematical Logic, since in Principia relations are not re- 
duced to classes. 

POI. 3, 75, 80. 

*For an account of this bundle of paradoxes see §2 of my paper “‘On the 
theory of types.” 

® It is of the latter system that “On the theory of types’ treats. Thus the 
view, accepted in that paper, that the mere requirement of stratification 
suffices for the avoidance of paradoxes, is not controverted by the present 
remarks. 
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which says of cardinal numbers what the other says of ordinals. In 
Cantor’s paradox it is argued that there can be no greatest cardinal 
number, and yet that the cardinal number of the class of cardinal 
numbers (or, indeed, of the class V) must be the greatest. The 
obstacle to deriving this paradox, in von Neumann’s system, is like 
the obstacle to deriving Burali-Forti’s. In the system of ‘‘New 
foundations” and in the simplified variant of Principia noted 
above, the obstacle to Cantor’s paradox is again like the obstacle 
to Burali-Forti’s.6 In Principia proper, with its unreduced theory 
of relations, the two paradoxes again behave alike: Cantor’s, like 
Burali-Forti’s, is dodged only by recourse to explicit type indices.’ 

The parallelism between the two paradoxes breaks down, for the 
first time, in the system of Afathematical Logic. The argument that 
leads to Cantor’s paradox zs obstructed in that system; for the 
argument depends on showing that g(y C x) always has more 
members than xz, and it is obstructed by the fact that f(y C z) 
comprises in general only certain of the subclasses of x, viz., those 
that are elements. My own inattention to Burali-Forti’s paradox 
came, indeed, of confidence in the parallelism between that paradox 
and Cantor’s. 


2. Elementhood in Dribleis. The system of Mathematical Logic 
without *200 is, as observed, consistent. It is adequate, moreover, 
to a substantial portion of the material covered in the book. Most of 
the theorems and metatheorems survive, with proofs unchanged, 
right up to the end of Chapter [V—hence to the beginning of rela- 
tion theory. Chapters I-III, of course, survive intact. 

Let us keep to solid rock while we can. Accordingly, before we 
consider bringing in any new axioms of elementhood to supplant 
*200, let us suppose Chapter IV carried over with mere omission of 
such few theorems and metatheorems as depended directly or in- 
directly on *200. These turn out to be nineteen in number: {210- 
7212, 1241, *253, *254, *256-*259, [261, *263, 1272-274, {359- 
1361, and {363. Nearly all of these are concerned explicitly with 
elementhood. 

The proof given in the book for Case 2 of *221 makes use un- 
necessarily of *200 (via {212), and should be supplanted by the 
following new proof: Let x be the formula just after the occurrence 

Concerning the solution of Cantor’s paradox for the system of ‘‘New 


foundations’ see my paper ‘“‘On Cantor’s theorem.”’ 
7See Principia, Vol. 2, 31. 
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of "(¢)'in question, let 6:1, . . . , Bm be the variables of x, and let 
7 be new. Form x; from x by putting y for the first 2 free occurrences 
of ¢. Suppose ¢’ is not ¢ (for otherwise -'221' by *100). Since ¢ is 
one of am, . .. , @n (cf. old proof), and similarly for ¢’, 


i617 39 L221 C7) =e 


#104 > ()( = y)' 

*291, Case 1 >. (xa = (Yxs" (2) 
for each z from 1 to the number k of free occurrences of ¢ in x. Thus 
*100 a re k .) L221 .D. x = (ox! 

*118, *123 >. (x =xe' (ke +1) 
#104 Eye += 1 4 zat Gx =x". (k + 2) 
Similarly -E'L221 >. (tx =x! (k + 3) 


*100, *117 t'L221 5 (6)... (Br)(kK+2.k4+3.D,] 
Ox =x)" 


7121 >.¢= ¢'1 
Then we may proceed, in an addendum to Chapter IV, to “more 
daring structures . . . at the constructor’s peril.’”’” We may add 


axioms of elementhood adequate to those nineteen postponed 
theorems and metatheorems and adequate also to the succeeding 
chapter on relations. Actually the only ones needed are these two: 


1400.8 (x) weV 
401. (x) (y) &, yeV Dea riyey 


From these the five theorems hitherto known as {211, {274, {210, 
$272, and {273 are respectively derivable as follows: 


1410a.° AeV 
Proof. +400 wa(zexeV (1) 
7343 (260 =.] .f#(@éx) = A (2) 
“223 [2.1 =) 410a 
14100. (@) ceV.=.feV 
Proof. {401 ZceV . ae njteV 
1276, *224 ». oe V (1) 
wl (al Dita Vv . fe 


8T shall use italics to distinguish newly applied reference numbers from 
reference numbers whose application remains as in the book. 

* This and its sequel are called +4/0aff. rather than {402ff. in view of Mathe- 
matical Logic, 90n. But note still that {410 is to be understood as coming after 
all these. 
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$275, *224 >.2eV (2) 
*100 [1 .2.D] 410 
1410c. VeV 
Proof. +293, *224 VeV[.= 410a] (1) 
+410b 410c [= I] 


1410d. (x) (y) L,Y ev ame 2 (ey cay 
Proof. +410b, *123 tye V Det) geV 


+401 >.ENGEV 
+275, *224 D.rNyeV 
+4106. (x) (y) z,yeV 5.2 \P yeV 
Proof. 281, *224 zyeV.D.rUyeV([:= 401] 
+410b Dre ye 


*259, moreover, is derivable from {410b in obvious fashion: 
“UL Of tape V.=.a~de V1. 


Proof. +410b (& D20) Hage V. 
*245, *224 


From among the nineteen theorems and metatheorems enumer- 
ated earlier, seven have now been covered: {210, 1211, *259, {272- 
+274, and {359, which last is {400 itself. The other twelve were 
proved in the book mediately on the basis of these seven; they may 
be renumbered as {41/0g-{410r, and their old proofs may now 
simply be reproduced in the new setting. Moreover, the succeeding 
Chapter V, on relations, remains intact (except insofar as occasional 
references in proofs have to be renumbered in the light of the fore- 
going rearrangement); for it happens that *200 is nowhere directly 
cited in Chapter V, all use of *200 being mediated rather by three 
({211, 7859, T3860) of the derivative theorems that have already 
been provided for. 

The whole of the book, therefore, up to the beginning of arith- 
metic, remains essentially unchanged when *200 is supplanted by 
its two meager instances ¢400-+401. The only effect is to shift and 
renumber nineteen theorems and metatheorems and to re-prove 
a few as above. Yet +400-{401 say no more than that unit classes 
(genuine and vacuous) are elements and that Boolean functions of 
elements are elements. 

When we push on to the matter of natural numbers, in Chapter 
VI, we do need a third axiom of elementhood, viz., the instance of 


. A(a édd)e V' 
-aw~ge V4 
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*200 designated in the book as 610; but no more than this, for it is 
only here that *200 is cited in the course of Chapter VI. Recon- 
strued as an axiom, {610 might be called f600. 

It seems advantageous to adhere thus to a minimum of element- 
hood axioms adequate to purposes at hand, rather than running the 
risk of something like *200. Then as we proceed to higher branches 
of theory we can set down supplementary axioms of elementhood 
explicitly as needed. We can do this with the realization that the 
ensuing theory is tenuous to the extent that those special axioms of 
elementhood are tenuous. Economy in the elementhood axioms 
governing a given portion of theory becomes a virtue to be striven 
for, providing as it does a measure of the soundness of the theory. 
The most solid part of class logic, thus, comprises everything up 
through the newly expurgated Chapter IV. The next most solid 
part comprises the addendum to Chapter IV, together with Chap- 
ter V; this theory, which is primarily the theory of relations, is 
tenuous only to the mild degree measured by the special element- 
hood axioms +400-{401 (or by still weaker axioms, if such prove 
adequate; see below). The next degree of tenuousness is exhibited 
by natural arithmetic (unless the special elementhood axiom {600 
for this domain proves dispensable after all; see below). As we pro- 
ceed into more and more remote and speculative branches of mathe- 
matics, we have before us always a growing record of our sins; and if 
our sins catch up with us we can repent them by degrees, prudently 
withholding any penitence in excess of what the occasion demands. 

Such wary postulation of elementhood piecemeal would seem 
indeed to be the procedure dictated by the scientific temper, once 
we have cast loose from patently consistent domains. And this pro- 
cedure fits with the familiar old postulational attitude toward 
mathematics, as opposed to the idea that the whole of mathematics 
is theoretically reducible to a bedrock of logical truisms. Reduci- 
bility to logical notation remains a fact; but, at the latest, the 
truism idea received its deathblow from Gédel’s incompleteness 
theorem. 

Gédel’s incompleteness theorem can be made to show that we 
can never approach completeness of elementhood axioms without 
approaching contradiction. For if every statement of elementhood 
were demonstrable or refutable then every logical statement would 
be demonstrable or refutable, since 


E'd =. A(aea.D >) € V' where a is not free in ¢. 
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Proof. +260 tL a(aéa) eV" (1) 
*100, *117 (& hp) E'~¢D (a)\(aéa.=:aea._ ¢)' 
VAL >:[1 =Ja(aea.D gd eV" (2) 
*100, *117 (& hp) +'¢D (a)\(aea._D ¢)" 
*193 _). Gave De) = Vv" 
*228 D:A(aea.D 6) eV [.= 410c]'. (3) 
*100 H'3.2.D:16 =. &(aea.D oe V' 


The growing list of elementhood axioms traverses the interval 
between the patent consistency of Mathematical Logic without *200 
and the inconsistency of Mathematical Logic including *200; and as 
it grows it provides for progressively more tenuous branches of 
mathematics, from natural and rational arithmetic through real 
arithmetic, theory of functions, infinite cardinal arithmetic, infinite 
ordinal arithmetic, and worse. 


3. Elementhood by Enumeration. Further investigation reveals 
that even the axioms {400-{401 are more powerful than need be 
for purposes of the topics covered in the book. We could supplant 
1401 by the weaker statement +41/0e without any real detriment to 
the sequel. Of the nineteen theorems and metatheorems in Chapter 
IV that depended originally on *200, ten do indeed drop when {401 
is weakened to t410e; but these ten are pretty readily dispensable. 
They are 210, {272, and {274 (i.e., -410b-d above), together with 
+241, *256, *258, *259, +261, *263, and +363. 

Under this improved theory, accordingly, the addendum to 
Chapter IV would consist of the axioms: 


+400. (x) weV 
1401’. (x) (y) LENA. cw TeV 


and a sequence of theorems and metatheorems answering to such 
others of the old nineteen as survive when the ten last enumerated 
are deleted. 400-1401’ answer to the old {359 and {273, and the 
following answer respectively to the old {211, 1360, 1361, *257, 
*253, *254, and +212. 


+410a. AeV Proof given above. 
+4100’. (x) (y) wzUlwevV 
410c'. (x)(y) wm (\yeV 


+*Z1Oe E'(a)~¢ D>. Ade V'. 
Proofs of +410b'—*410d’ like those given in the book under {360, 
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7361, and *257 (but with ‘359’, ‘273’, and ‘211’ changed of course to 
‘400’, ‘401"’, ‘410a’). 
*410¢’. If ais not free in ¢, H+'€d = V.= o'. 


Proof. *231 (& hp) E'(a)\(aeV ._> ¢) D. [410a D] 6". (1) 
*100, *117 (& hp) /t'¢?D (a)(aeV.D 6)" (2) 
*100 -(249.1.2.>] 410e"". 


*410f". If a is not free in ¢, FH '€d = A. =~". 
Proof similar, using *250 instead of *249. 


14109’. (y) (ax) (x # y) 
Proof. *231 (2)(2 =y)D.4@ex) = y 
"223 _):(260 =.]yéeV (1) 
“231 (zj@=y)D.k=y 
“228 >:[410a =.JyeV (2) 
*100 Ss (1.2. dD] ~@)(e = y) (3) 


*130 (3 =] 4109’ 


Though {401’ asserts far less than ¢401, the theory founded on 
+400-+401’ is quite adequate to relation theory. Chapter V remains 
intact, except of course for change of the few occurrences of ‘211’, 
‘359’, and ‘360’ in proofs to ‘410a’, ‘400’, ‘410b”’. For it has already 
been remarked that the three theorems thus designated constituted 
the only channel through which Chapter V drew on *200. 

And it turns out that the meager basis under consideration will 
suffice even for the arithmetic of natural numbers, provided that 
we alter our definitions to suit. This new theory of natural numbers 
will be set forth in the next section. It is only when we proceed to 
derive theorems in higher branches, e.g., in the theory of real num- 
bers or in that of infinite numbers, that we have to add another 
axiom or so of elementhood. 

1400-+401' do not provide infinite elements, but they guarantee 
that every finite class is an element. Any class of n members, say 


(i) a mr... WU &s, 


is shown to be an element by iteration of the process that leads 
from {400-T401' to +410b’. Whenever a class is specified by enu- 
meration of its members, it is thereby shown to be an element under 
1400-{401’; for to specify a class by enumeration is to specify the 
class by an expression of the form of (i). 
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The need of adding an axiom or two providing for infinite ele- 
ments arises only when we proceed to derive theorems in branches 
of mathematics beyond the scope of the theorems actually pre- 
sented in the book; e.g., when we turn to the theory of real num- 
bers or to that of infinite numbers. A boundary thus appears which 
is suggestive of the old distinction, drawn traditionally in a pretty 
vague way, between the actual infinite and the potential infinite. The 
axioms of elementhood 400-401’ might be said to posit the poten- 
tial infinite only, in that they guarantee infinite classes but not 
infinite elements; whereas an axiom such as ‘w e V’ (see D39 below) 
posits an actual infinite, viz., an infinite element. 


4. Natural Numbers as Counter Sets. A revised theory of natural 
numbers that avoids any elementhood assumptions beyond +400 
and 7401 is obtained by construing the natural numbers as classes 
of the sort that in the book were called counter sets.° Thus, we 
define the successor relation § in the way that the self-augment 
relation Sa was defined in the book, viz.: 


D36. S’ fon Az(o.) we)’. 


The number 0 becomes the null class. We may think of the old 
definition D16, accordingly, as elaborated thus: 
D16. ‘A? or 0’ for ‘t(a ¥ 2)’. 
The sign ‘A’ might well be dispensed with in favor of ‘0’ from the 
start, in reversion to Boole’s original notation; but for the present 
I am keeping both signs so as to avoid needless departure from the 
style of Mathematical Logic and from current practice generally. 
Let us merely think of ‘A’ and ‘0’ hereafter as if they were one and 
the same sign. I shall supplant the one by the other at will without 
even citing D16. 

The numbers 1, 2, etc., are defined in obvious fashion: 


D3?7. ‘lV’ for (50), 
Ds. ‘2? for ‘(S‘1)’, 
etc. The class of natural numbers is defined in terms of ‘0’ and ‘8’ 


as before; but I shall use ‘w’ now rather than ‘Nn’ to designate this 
class, for the class will subsequently turn out to be the same thing 


10 Von Neumann has construed the finite ordinals in this way. See below, §6. 
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as the ordinal number w. Thus 
D39. ‘w’ for ‘(S10)’. 


The old theory identified each natural number n with the class of 
all n-membered classes; the new theory, on the other hand, identi- 
fies n rather with a certain standard n-membered class. On the 
score of naturalness, the theories have about equal claims. Insofar 
as one thinks of a number n as a property of aggregates, viz., the 
property of n-foldness, the old theory is sustained. But the new 
theory is sustained insofar as one thinks of a number 7 rather as a 
certain comparative standard of multiplicity used in counting n 
objects. What we correlate with n objects in counting them, 1.e., 
what we mention as we point to the successive objects, are the first 
nm numbers; and thus there is some appropriateness in construing a 
number 7 as the class of the first n numbers themselves. The ap- 
parent circularity here is resolved merely by admitting 0 among the 
first n numbers; thus n ceases to be a class of m numbers including 
itself, and becomes rather the class of the nm antecedent numbers. 
Once we have gone this far in characterizing the natural numbers, 
the identification of natural numbers with counter sets is inescap- 
able. If 0 is to be the class of all earlier natural numbers, then 0 is 
clearly A; if 1 in turn is to be the class of all earlier natural num- 
bers, then 1 must be .0, i.e., A; if 2 is to be the class of all earlier 
natural numbers, then 2 must be 10 U tl, i-e., eA U wA; and so on. 

This theory emphasizes the analogy between natural numbers 
and other standards of measurement. A natural number is a stand- 
ard of comparison for measuring multiplicity just as the Paris 
meter stick is a standard of comparison for measuring length. It 
just happens that measuring sticks for multiplicity are definable 
within logic whereas measuring sticks for length are not. 

Whereas application of natural numbers in the old sense pro- 
ceeded by membership, application of natural numbers in the new 
sense proceeds rather by correlation. To say that x has n members, 
under the old theory, we said that « en. Under the new theory, on 
the other hand, we say rather that the members of x correspond in 
one—one fashion to those of n; or, what is equivalent under the new 
version of natural number, that n is the class of all those natural 
numbers that have fewer members than has x. This notion of hav- 
ing fewer members will be defined in §6. 

But meanwhile, independently of that development, we can 
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attend to the pure arithmetic of natural numbers. The first two 
theorems in this direction serve the purpose now of the old {610 
and 614, and will be numbered accordingly. 


7610. (x) eV .D.gUweV 
Proof. 401’ xeV{.400]).D.2 UireV 
1614. @) 26 VebDasin= 2 Wee 


Proof from {610 like that of the old 614 from 610. 


Thence we have this corollary: 


1616. 1= 0 
Proof. 1614 (& D37) [410a D.J1=0U0 
1297 =F 
The following theorem is the analogue of the {624 of the book. 
1624. (x) zeV .=.S‘c #0 
Proof. 7317 Cel iz (1) 
#223 SU ie=0.531 =) Co 
$336 >. = A (2) 
4433 Sa2=2Uw.D.8% =0.5.4c=— aia 
(3) 
7534 ZEV .D 52 —0 (4) 


*100 [38.4.614. 343 .D] 624 


Nine further theorems and metatheorems are the {617 and 7630- 
*637 of the book, notationally unchanged except of course that ‘Nn’ 
becomes ‘w’. The proofs remain as in the book, except for the obvi- 
ous adjustments of references: 610, 1611, D39, +615, f616, and 
D40 give way respectively to 610, 410a, D36-39. 

The remaining crucial theorem to derive, for arithmetical pur- 
poses, is the theorem {677 to the effect that no two natural num- 
bers have the same successor. The following three theorems are 
lemmas for it. 


1568. i zteV .D.x =Gr 
Proof. +564 yex.ceV.= Qly, 2) 
7479 =, yee a (1) 


“HOO, “U7 rey nm) (y)((1 =e =.ye Qz), 
q.e.d. (cf. D10). 
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The converse of the above could also be established, but there is 
no need of that for present purposes. 


1676. (x) zeV .-. GS*'r) = (4%r) Use 

Proof. +472 G“(¢ U ix) = (Ex) U (Cex) (1) 
*223 a= OC am. Kea Je) =] (Ce Usl.= if (2) 
7703 BPaS2Ww.D:.2 = Ea .D R67 [(:= 7] (3) 


*100 (8. 568". 614.. W676 


The following theorem plays the role of the old 676, showing as it 
does how any natural number can be expressed as a function of its 
successor. 


t676. G) rew.).£ = (SZ) 

Proof. 277, *227 eWae=2U02Ve@ (1) 
*993 «= R6VE DD: [1 =J2VU we = (482) UzVUe (2) 
4223 nolia DithG.ooD.60=—(Cem) YU Sars=2) ©@ 
1536 S‘reV (4) 
7675 = [4 D.] E*G*G%z)) = (E"(S'z)) U Sz) (5) 
*223 lp omy R64 D: R676 D8 = GS ez 2} ©) 
+190 DOO.) eV. (7) 
*100 (6.7.614 J2ew. R676 .D.S‘s = EG (S8'(S%s)) (8) 
1568  [410a D.]0 = @*40 
1615(& D37),*227 = @*(S‘0) (9) 


*637 [(2)8.9 .]Jxew._ R676 


From {676, finally, {677 can be proved precisely as it was proved 
from {676 in the book. And now all the definitions and theorems 
and proofs of the succeeding §§47-49 of the book, on powers of rela- 
tions and the algorithm of arithmetic, can be carried over without 
any changes whatever except for putting ‘w’ for ‘Nn’ and using 
1410a, D37, and 1624 (with its minor complication) instead of 
1611, 615, and 624. 

Note incidentally that the theory of greater and less among num- 
bers is very much simplified by the present approach; for, where x 
and y are numbers, x S y if and only if x C y. We can think of 
‘cs S y’ in general as meaning ‘x C y’, since this reproduces the 
customary arithmetical sense so long as x and y happen to be num- 
bers.” Thus the laws: 


11 The identification of ‘S’ with ‘C’ is suited to the arithmetic not only of 
natural numbers but of cardinals and ordinals generally (cf. §6 below), and 
even of real numbers—both when real numbers are taken as Dedekind segment- 
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(x) Ons. 

(x) Le 
(x)(y) C= Y=. FSy.y =2, 
(x) (y) (2) ia yeyese 1 = 2 


appear as mere transcriptions of the general laws 1334 and {310- 
7312 of class algebra. Again the law: 


(x) eV ..): aoe 


appears as a corollary of {614 and {317. The idea might be enter- 
tained of using ‘S’ instead of ‘C’ in the general class algebra from 
the start; this would be a reversion to the old notation of the logical 
algebra, quite like the suggested restoration of ‘0’ in place of ‘A’. 
Similarly ‘x < y’ could be given the general sense of ‘¢ Cy. 
xx y’, 


§. Functional Aspects. As a means to formulating the idea of 
having n members (see above), and as a means equally to develop- 
ing a theory of infinite cardinals and ordinal numbers, we need the 
notion of one-one correspondence between the members of two 
classes; or, what proves to be just as adequate and more simply 
manageable, the notion of there being fewer members in one class 
than another. Treatment of this notion is facilitated by the intro- 
duction of a preliminary notion that I call functional aspect—a 
notion that I regard as of some logical interest also apart from its 
immediate use in this paper. 

Membership is, in an obvious sense, inverse to class abstraction. 
Thus we have: 


Hae V .Diacdd.= |. (*230) 


Correspondingly relational predication is inverse to relational ab- 
straction: 


L'a, yeV .. afg(a, vy) = ¢$". (433) 


classes of ratios (cf. Mathematical Logic, 271f.) and when they are taken rather 
as relations @ “‘x such that x is a Dedekind segment-class of ratios (cf. Principia 
Mathematica, Vol. 3, 336). The identification of ‘S’ with ‘C’ is suited also 
to the arithmetic of ratios, if we merely reconstrue the ratios cumulatively; 1.e., 
if in the first definition of §50 of Mathematical Logic we change ‘=’ to ‘<’. 
This cumulative version of ratios is advantageous also on other grounds; for it 
causes the ratios to be identical with the rational reals, once the relational 
version of reals mentioned above is adopted. The reals come to constitute a 
genuine extension of the ratios. 
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Correspondingly, again, functional application is inverse to func- 
tional abstraction: 


Ela, feV haigta Se (*541) 


To an important degree, moreover, these relationships of inverse- 
ness also hold in the opposite direction. Thus 


Gyex) =x (7189) 
whenever z is a class (which it necessarily is'”); again 
Gary, z)) = 2 


whenever the z is a relation; and finally, as will be proved in {574 
below, 


Ay(x‘y) = x 


whenever the z is a function— if by a “function’”’ we understand for 
the present a single-valued function defined for all arguments, or 
in other words a relation x such that rz = V. 

In view of the second of these last three principles, it was con- 
venient to abbreviate ‘j2(x(y, z))’ as ‘z’. In view of the last of the 
three principles, similarly, it will be convenient to abbreviate 
Ay(a‘y)’ say as ‘fx’. 


D33a. fee” sor "Xoeay". 


Just as ¢ is a relation in any case, and is the same as x if x is a rela- 
tion, so fx is a function in any case, and is the same as z if ris a 
function. Thus, just as ‘%’ served virtually as a simple relation 
variable, so ‘x’ will serve virtually as a simple function variable. 

Whereas z is called the relational part of x, we cannot speak of 
fx as the functional part of x; for ¢x is not in general included in z. 
Rather, ¢x proves to be a relation comprising all those pairs z;w 
such that z(z, w) and we rz, and in addition all those pairs A;w 
such that wetrz. Deprived thus of the term ‘functional part’, I 
have chosen to call ¢x the functional aspect of x. 

Parallel to the theorem: 


(x) (y) ) tly, 2) = xy, 2) (7436) 
concerning relational parts, we have this concerning functional 


aspects: 


12Cf. Mathematical Logic, 135. 
13 Tbid., 204 ff. 
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1570. (x) (y) ra’y = xy. 
Proof. *541 (& D33a) ye V |. 586] .D. ey = Frey (1) 
1534 eV . apt — A 
“223 D.9ée€V ..29 = ey (=e) (2) 


*100 (1.2. >.)aty = eae 


So we can agree to drop the digamma automatically from contexts 
of the form ‘¢f‘n', just as we agreed" to drop the dot from '&(n, 6)". 
In particular, ‘¢¢2’ gives way thus to ‘rz’, just as ‘z’ gave way to 
we. 

The next two theorems are two ways of saying that ¢r and x 


coincide for arguments in rz. 


t571. (x) (y) ) yeu... rx, y) = xz, y) 

Proof. *540 (& D3&3a) fu(z, y) =. ye V [. 536]. 2 = xy (1) 
+101 R531 5: [1 =:] ee, y) =. ye Vi ci) 
*100 (& D24) (531 .2.D)] 671 
1572. (w) (2) C.D. pe w= 2 


Proof. *100; *117 
yew... yerx :_ (z)((671 D:)] px, y) . yew .=. x, y). yew) 
(1) 
*102 (& D21) [a)LJ we C wD @)) 
*186 (& D28) Dp = aw 


That ¢z is a function in any case, and coincides with x whenever 
x is a function, is recorded in the next two theorems. 


1573. (x) tae = V 
Proof. *193 [(y)586 DD.) ¥ = way € V) 
*543 (& D38a) = Gre 
fo74.) (x) fe= 2 =e = WV 
Proof. *223 -x==2. love =e = (1) 
*223 re = V.D. [(z)(y)671 =)@)(y) ye V .D R571) 
*100 (& D24), *123 > (2)(y) R571 (2) 


+447, *123 (& D5) 574[=.1.2] 


I have set forth the general theory of functional aspects at this 
length rather out of consideration of the general utility or interest 


1 Thids, p. 205. 
4 Substantially this principle is mentioned on p. 227n, ibid. 
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of that new notion than by way of groundwork for the brief re- 
mainder of the present paper; for the only use that I shall make of 
that notion here is in giving a somewhat simpler form than other- 
wise to a couple of definitions. 


6. Ordinals and Cardinals. A class x is said to have no fewer mem- 
bers than a class y if the members of y can be exhausted by assign- 
ing each to one or more members of x without ever assigning two 
members of y to a single member of x; in other words, if y consists 
of values of some function for arguments belonging to 2; in other 
words, if (3z)(y C rex). Thus x has fewer members than y just in 
ease (2) ~(y C rex). The notation ‘xr < y’, read ‘x has fewer 
members than y’, is accordingly introduced in the following fashion: 


Fo < 9)" for “(a) ~(@ C raf)". 


As remarked in §4, x will have z members, for any natural num- 
ber z, just in case z is the class of all those natural numbers that 
have fewer members than x. In other words, where x is any finite 
class, the number of members in x is J(yew.y < x). But where x 
is an infinite class, the “number of members in x” in the sense just 
proposed reduces to w itself; for where zx is infinite the clause 
‘y < x’ is fulfilled by every natural number y. The proposed sense 
of “number of members in x” thus works properly only for finite x; 
it ceases to discriminate sizes of classes when the classes are infinite. 
This is of course to be expected, since we have been dealing so far 
only with finite numbers. So let us turn our attention now to 
infinite numbers; for after we have considered these, the above 
formulation of “number of members in x” will turn out to admit of 
immediate extension so as to accommodate finite and infinite classes 
equally well. 

The series of ordinal numbers can be taken as the obvious exten- 
sion of the series of natural numbers. The finite ordinals can be 
identified with the natural numbers themselves; the first infinite 
ordinal, next, generally referred to as w, can be taken as the class w 
of all natural numbers; the succeeding ordinals w + 1, w + 2, etc., 
can be explained as S‘w, S‘(S‘w), etc., i.e., as ow U w, wo U w U 
u(w U w), etc.; 2~ becomes the class of all these that have gone be- 
fore, hence w U (,S“‘tw); 2w + 1 is S‘(w U (,8“uw)), and so on.! 
Each ordinal, finite or infinite, is the class of all its predecessors. 


16 Such is von Neumann’s way of construing the ordinals in his 1923 paper. 
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The class of ordinals turns out to be definable in a fairly simple 
way: 


‘NO’ tor “z(y) (Sy Cy. @)@ C yee Gee y) Dawe 


NO is of course the class only of those ordinals that are elements; 
and, whereas all the finite ordinals are elements by virtue of being 
finite classes, the question how far elementhood extends into the 
infinite ordinals is one that depends on adoption of further element- 
hood axioms according to taste or to special purposes at hand. 
Note, though, that at most one ordinal, in the sense informally set 
forth in the preceding paragraph, can fail of elementhood; for an 
ordinal is the class of all its predecessors, and hence has only ele- 
ments as predecessors. So, however far the series of ordinals that 
are elements is assumed to extend, the very existence of ordinals 
must be reckoned as leaving off after one more step. The ordinals 
that are elements are the members of NO; the ordinals in the 
broader sense are the members of NO and NO itself. 

Now we are in position to extend our earlier formulation of ‘the 
number of members in 2’ so as to allow for infinite x. We have only 
to use ‘NO’ where we previously used ‘w’, thus: f(ye NO. y < 2). 
The number of members of 2, or, as it will hereafter be called, the 
cardinal number of x, is thus explained as the class of all ordinal 
numbers that have fewer members than z. 


"et for "a(aeNO.a < £)4 


The class of all cardinal numbers may thereupon be explained as 
£(ay)(« = ney); or, what is equivalent, 


“NiC’ forté(a= nex)’. 


A cardinal number is, according to these constructions, an ordinal 
number; for a cardinal, like an ordinal, has as members all and only 
the ordinal numbers up to some point. But still a cardinal is an 
ordinal only of a special sort. As is evident from the definition of 
"net! the distinguishing characteristic of cardinals is this: a 
cardinal will never have one ordinal z among its members unless it also 
has among tts members all other ordinals that have as few members as z 
has. I.e., since the members of an ordinal (and, in particular, of a 
cardinal) are simply all the preceding ordinals, a cardinal will never 
have one ordinal z among its predecessors unless it has also among tts 
predecessors all other ordinals that have as few predecessors as z has. 
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In short, a cardinal is any ordinal that has more predecessors than 
has any of its predecessors. 

Thus in particular the finite cardinals are the same as the finite 
ordinals—and hence the same, again, as the natural numbers. The 
first infinite cardinal, commonly called No, is the same as the first 
infinite ordinal w. But the second infinite cardinal, commonly 
called Ni, is not the same as the second infinite ordinal S‘w; for the 
predecessors of S‘w are denumerable and hence no more numerous 
than the predecessors of w. The first ordinal whose predecessors are 
more than denumerable is the one commonly known as a; and it is 
with this that &: comes to be identified. Similarly Ne is we, Nu iS wa, 
and so on. A cardinal number 2, in general, is identified with the 
first ordinal that has x predecessors. 

The class of ordinals NO, the class of cardinals NC, and the 
cardinal number of z have all been defined. Let us turn, finally, to 
the definition of the ordinal number of x, where x is thought of now 
as a well-ordered series. Just as the cardinal number of «x is ex- 
plained as the class of all ordinals y such that y has fewer members 
than xz, so the ordinal number of x may be explained as the class of 
all ordinals y such that the series of ordinals up to but excluding y 
is a shorter series than x. Le., 


nor = (ye NO. suby << 2), 


where suby is yet to be defined as the series of ordinals up to but 
excluding y, and ‘<<’ is to mean ‘is shorter than.’ 

If we identify a series with that relation which (intuitively 
speaking) each element of the series bears to itself and to each sub- 
sequent element of the series, a suitable definition of ‘suby’ is 
apparent; for 


suby = 24(z2, ce NO.2 S$ 2.4 < y). 


But, where z, x, and y are ordinals, ‘z Sx2.x < y’ amounts 


merely to ‘zC x.xey’; and, where y is an ordinal, the clause 
‘2, « NO’ amounts to ‘zey’ alongside ‘z Cx.xzey’. Thus we 
might explain suby in very general and elementary terms, for any 
class y, as the relation 24(z2 C x.z2, vey); i.e., simply as the sub- 
class relation among members of y. Then suby, so defined, becomes 
in particular the series of ordinals up to but excluding y so long 
as y happens to be an ordinal. As to ‘<<’, a definition that proves 
to give the intended sense so long as x and y happen to be well- 
ordered series consists in explaining ‘x << y’ as short for: 
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(2) ~Yy C re |2| “r2). 


Our trio of definitions is accordingly as follows: 


"subt! for "Af(a C y.a, yet)|, 
"<< 9) for™@) Gi Fe |F | a)", 
Tot" for "@(a eNO. suba<< £)". 


Note incidentally that an expression for the series of ordinals, as 
distinct from the class of ordinals, is ready to hand in ‘subNO’; and 
the series of natural numbers is subw, and that of cardinal numbers 
is subNC. 


The revision of Mathematical Logic that is called for by the in- 
consistency of *200 involves nothing of what has been set forth in 
§$3-6 of the present paper. The essential revision consists rather in 
merely changing Chapter IV as explained in §2 above, and recon- 
struing 7610 of Chapter VI as an axiom {600. Correlative changes 
elsewhere in the book are slight and obvious: deletion of the Ap- 
pendix, simplification of §58, and renumbering of any cross-refer- 
ences to the few renumbered theorems. What has been set forth in 
§§3-6 above is then an alternative development, appropriate per- 
haps as an appendix. 


XL 


«t On an Application of 
Tarski’s Theory of Truth 


Consider two interpreted systems of notation, L and L’, in which 
statements can be formed. Let L’ contain L together with what I 
have called the protosyntax of L;' i.e., the elementary means of 
talking about the expressions of L. For certain such systems L and 
L’, we know from Tarski’s work? (familiarity with which will not, 
however, be presupposed here) how to define truth for Z in L’; 1.e., 
how to translate ‘y is a true statement of L’ (with ‘y’ as a variable) 
into a formula of L’. 

But if L contains adequate notation for elementary number 
theory, and L’ consists of Z and its protosyntax and nothing more, 
then definability of truth for Z in L’ leads to paradox. For, Tarski 
has shown? how to derive paradox from definability of truth for L 
in L; and L’ does not essentially exceed L, since protosyntax can be 
reconstrued as elementary number theory by Gédel’s expedient of 
assigning numbers to expressions.’ 

Hence if in particular the system of Mathematical Logic is con- 
sistent, Tarski’s method of defining truth for ZL in L’ must some- 


how break down when L is taken as the logical notation of Afathe- 


The content of this paper had been figuring in my seminar for some years when, 
early in 1952, I was prompted by outside inquiries to write it up for the Pro- 
ceedings of the National Academy of Sciences. It is reprinted here by permission. 
This paper, unlike the preceding one, requires no consultation of Jfathematical 
Logic. 

1 Mathematical Logic, Chap. 7. 

2“Der Wahrheitsbegriff.”’ 

>See, e.g., Mathematical Logic, 313ff. 
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matical Logic and L’ is taken as just L plus the protosyntax of L. It 
is the purpose of the present paper to follow out the details of this 
breakdown. An outcome of the inquiry will be the discovery of a 
recursive definition which cannot, within the logical resources of 
Mathematical Logic, be turned into a direct definition (if the sys- 
tem is consistent). 

So suppose henceforward that L is the logical notation of Mathe- 
matical Logic and L’ is L plus the protosyntax of L. In L there is an 
infinite alphabet of variables, say ‘s’, ‘’, ... , ‘2’, ‘s’”, ete. Sup- 
pose x is a function, or one—many relation, which assigns an entity 
to each variable. In Tarski’s terminology, x is said to satisfy a 
formula y of L if y comes out true for the values of its free variables 
which are assigned to those variables by z. Vacuously, then, if y is 
a statement (hence devoid of free variables), y is satisfied by every 
function x or by none according as y is true or false. So, if we can 
define satisfaction, we can define ‘y is true’ as ‘Stat y . (x)(x sat- 
isfies y)’ where ‘Stat y’ means ‘y is a statement’. We know how to 
define ‘Stat y’ in the protosyntax of L;* so the problem of defining 
truth for L in L’ now reduces to the problem of defining satisfaction 
for Lin L’. 

The atomic formulas in the notation of LZ consist of the ‘e’ of 
membership flanked by variables; and the other formulas of Z are 
built from atomic ones by joint denial (‘ |’) and universal quanti- 
fication. Let us refer to the atomic formula whose respective vari- 
ables are y and z as y e z; let us refer to the joint denial of the re- 
spective formulas wu and v as u j v; and let us refer to the universal 
quantification of v with respect to y as y qu v. Thus if y is ‘w’, zis 
‘vy’, wis ‘xet’, andvis ‘(tew | tew)’, then yez is ‘wex’, ujvis 
‘(cet | (ew | tew))’, and yquy» is ‘(w)(tew | tew)’. We 
know how to define the notations ‘y e 2’, ‘uj v’, and ‘y quv’ in the 
protosyntax of L.§ 

To say that a function x satisfies an atomic formula of member- 
ship is to say that what z assigns to the left-hand variable is a mem- 
ber of what x assigns to the right-hand variable. Thus 


(1) x satisfies yez.=. xy € xz. 


Next, supposing we already know what it means to say that a func- 
tion x satisfies a given formula y and that x satisfies a given for- 


Opec. 298. 
5’ Ibid., 295. 
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mula z, we can easily explain what it means for x to satisfy their 
joint denial y j 2; viz., that x satisfies neither y nor z. So 


(2) x satisfies yj z.=:2 satisfies y. | . x satisfies z. 


It remains only to explain satisfaction of a quantification y qu z in 
terms of satisfaction of z; thereupon we shall have determined, 
recursively, the notion of satisfaction in relation to all formulas of 
L. Now it would appear that satisfaction of y qu 2 can be explained 
as follows: x satisfies y qu z if x not only satisfies z, but continues to 
satisfy zg even when what x assigns to the variable y is changed ad 
libitum; 1.e., 


(3) x satisfies y qu z .= 
(v)((w)(w # y.D. vw = zw) >.» satisfies z). 


In (1)—(8) I have followed Tarski. But Tarski’s construction was 
geared to the Russell—Whitehead logic, in which all entities are ele- 
ments, i.e., they all admit of membership in classes. For L, on the 
other hand, interpreted conformably to the axioms of Mathematical 
Logic, there are elements and non-elements. The possible relata of 
relations for L, and hence in particular the possible entities assigned 
by functions, are exclusively elements; but quantification in L 
ranges over all entities. Consequently (3) fails for L. For, the right- 
hand side of (3) says in effect ‘x satisfies z and continues to satisfy 
z when what x assigns to the variable y is changed to any other 
element’. We need ‘entity’ here rather than ‘element’ if we are to 
reproduce the intended force of ‘z satisfies y qu 2’. 

Basically the difficulty is as follows. What are wanted as satis- 
fiers of formulas are, in some sense, systems of values of variables. 
Functions, taken as assigning values to variables, are what we have 
been using as systems of values of variables. But functions can 
assign only elements, whereas both elements and non-elements are 
wanted as values of variables. Now the difficulty can be overcome 
by a small adjustment. Intuitively what is wanted of a system of 
values of variables is a specification of this sort: zo as value of ‘s’, 
2 as value of ‘t’, x2 as value of ‘w’, and so on. Now instead of repre- 
senting such a system of values formally as a function, viz., as the 
relation which relates 2 to ‘s’, x; to ‘t’, x2 to ‘uw’, and so on, let us 
represent it rather as the relation which relates each member of 2 
to ‘s’, each member of x; to ‘t’, each member of x2 to ‘wu’, and so on. 
There then ceases to be any requirement that xo, 21, X2, etc., be 
elements. They have to be classes, but this is no restriction, since 
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for L everything is a class; individuals are identified with their own 
unit classes.® 

Where x was a system of values in the old sense, viz., a function 
assigning the values to the variables, the value assigned by z to a 
variable y was x‘y. Where, on the other hand, x is a system of values 
in the new sense, viz., the relation of each member of the assigned 
value to the variable to which it is assigned, the value assigned by 
x to a variable y comes rather to be describable as the class of all 
things bearing z to y; briefly, x““y. Whereas x‘y had to be an ele- 
ment, x“‘.y can be anything. 

(1) and (3) can be adjusted to this revision by changing ‘z‘y’, 
‘cz’, ‘vw’, and ‘x‘w’ to ‘a“‘wy’, ‘az’, ‘vw’, and ‘x“‘w’. Or, equiva- 
lently and more elegantly: 


(4) x satisfies yez.= x(x““y, 2), 


(5) x satisfies y quz.= 
(v)((w)(yew.V. vw = zw) >. v satisfies z). 
(2) remains unchanged. 

In (4), (2), and (5) we have a recursive definition which deter- 
mines satisfaction correctly for all formulas of L. Moreover, all the 
notations used in (4), (2), and (5), other than the predicate ‘satis- 
fies’, are notations definable either in the logic Z or in the proto- 
syntax of L; all, in short, belong to L’. Satisfaction for L has been 
recursively defined in ZL’. Truth for L is then definable, as noted 
earlier, in one more easy step. 

But we are not yet involved in paradox. Paradox would ensue if 
satisfaction for L were directly definable in L’; i.e., if ‘a satisfies y’ 
were translatable into L’ not merely for each specific formula as 
value of ‘y’, but for variable ‘y’. Recursive definition does not auto- 
matically guarantee such translatability. However, there is a well- 
known technique due to Frege for turning recursive definitions into 
direct ones. Applied to (4), (2), and (5), the technique is this: ‘s 
satisfies ¢’ is explained as meaning that s bears to ¢ every relation wu 
which obeys the laws which are imposed on the satisfaction rela- 
tion in (4), (2), and (5). Thus ‘s satisfies ’? would be defined as: 


(6) (u){@)(y)@lu@, yez) = xa“y, z) : 
u(z, yjz) =. ula, y) | ula, z): 
u(x, y quz) = (v)((w)(yew.V.0%w = zw) D ua, z))] 
>) Ue, Oe 
6 Ibid., 122f., 135. 


7 Strictly, clauses should be inserted stipulating that y and z be variables in 
certain cases, formulas in others. They are suppressed for perspicuity. 
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But this maneuver sacrifices an important part of the force of 
(4), (2), and (5). For (4), (2), and (5) admit, in the roles of v and z, 
elements and non-elements on an equal footing; whereas (6), by 
representing v and z as relata of a relation u, renders irrelevant all 
values of ‘v’ and ‘x’ except elements. Because of this divergence, 
(6) as definiens of ‘s satisfies ¢’ fails to reproduce the sense of (4), 
(2), and (5). We may be as confident of this failure on the part of 
(6), anyway, as we are of the consistency of L. Satisfaction for L 
has been recursively defined in L’, but it is not directly definable in 
L’ unless L is inconsistent. 


<i 


«a On Frege’s Way Out 


In his review of Geach and Black’s Translations from the Philo- 
sophical Writings of Gottlob Frege, Scholz urges that I evaluate the 
idea which Geach and Black express thus in their Preface: 


Special attention should be paid to Frege’s discussion of Russell’s para- 
dox in the appendix to vol. ii of the Grundgesetze. It is discreditable that 
logical works should repeat the legend of Frege’s abandoning his researches 
in despair when faced with the paradox; in fact he indicates a line of solu- 
tion, which others (e.g. Quine) have followed out farther. 


Geach becomes more explicit in a footnote (p. 243): 


The way out of Russell’s Paradox here suggested by Frege has been 
followed by several later writers—e.g. by Quine in Mathematical Logic. 
Quine’s particular form of the solution would be stated as follows in 
Frege’s terminology: The concepts not a member of itself and member of 
some class but not of itself have the same class as their extension; but the 
class in question is not a member of any class, and thus falls only under 
the first of these concepts, not under the second. (But the relation of 
Quine’s solution to Frege’s is obscured because Quine does not explicitly 
make Frege’s distinction between a concept or property and its extension.) 


It is thus that I have been prompted to write the present paper. 
Church in his review has described Geach and Black’s idea as 

“far-fetched at best,” and a paper on the scale of the present one is 

not needed to bear him out. It has seemed worth while, however, to 


First published in Mind, 1955. An Italian translation appeared in the Rivista di 
Filosofia the same year. The paper appears here by permission of the editor of 
Mind. I have omitted an appendix because, as Church showed me, it was 
wrong. 
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relate Frege’s “line of solution” to the history of set theory and also 
to show how it leads to contradiction. 


1. Frege, Whitehead, and Russell on Attributes. Taking our cue 
from the concluding parenthesis of the note quoted from Geach, we 
may begin by reflecting briefly on ‘‘concepts or properties.” Or, 
since ‘concept’ hints of mind and ‘property’ hints of a distinction 
between essence and accident, let us say attributes. An open sen- 
tence, e.g., ‘x has fins’, is supposed to determine both an attribute, 
that of finnedness, and a class, that of fin bearers (past, present, 
and future). The class may be called the extension of the open sen- 
tence; also it may be called the extension of the attribute. 

Since Whitehead and Russell’s work is nearer home than Frege’s, 
and since occasion will arise anyway for certain comparisons, let us 
first view the notion of attribute in the context of Principia Mathe- 
matica. Whitehead and Russell, like Frege, admitted attributes in 
addition to classes. At the level of primitive notation, indeed, 
Whitehead and Russell adopted attributes to the exclusion of 
classes, and then introduced class names and class variables by 
contextual definition in terms of their theory of attributes; but at 
any rate their derived body of theory embraces the attributes and 
classes in effect side by side. The attributes were called propost- 
tional functions in Principia, and there was confusion between 
propositional functions in this sense and propositional functions in 
the sense of open sentences or predicates; but this confusion is now 
inessential, for we know how to rectify it without upsetting the sys- 
tem.” To proceed, then: in Principia we find variables ‘¢’, ‘Wy’, etc., 
for attributes, and ‘a’, ‘6’, etc., for classes. Also we find two ways 
of using circumflex accents in connection with open sentences, one 
for the abstraction of attributes and the other for the abstraction 
of classes; thus ‘é has fins’ names the attribute and ‘é(x has fins)’ 
the class. Formally the one difference between attributes and 
classes, for Principia, is that the law of extensionality: 


(1) (x)(cea.=.2€8)>D.a=8 


holds for classes whereas the corresponding law. 


1 For a treatment of the relevant notion of predicate see §1 of paper XVII 
below. Predicates in this sense are, like open sentences, notational forms. They 
are so devised as to expedite technical discussions of substitution. At those 
points in Principia where the “propositional functions’? ambiguously so-called 
can be construed as notational entities at all (rather than as attributes), they 
are identifiable more aptly with predicates than with open sentences. 

2 Cf. §5 of I, above. 
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(2) (t)(oz = yx) D.¢=y 


is not supposed to hold for attributes. The whole trick of defining 
class notation contextually in terms of attribute notation, in 
Principia, is indeed a trick of rendering the law of extensionality 
demonstrable for classes without adopting it for attributes. 

The use of distinctive notation for attributes and classes is a 
feature of Frege’s theory as well as of Principia. On the other hand 
there are differences. In Principia the realm of attributes can stand 
intact without benefit of classes—as indeed it explicitly does, from 
the point of view of primitive notation. For Frege, on the other 
hand, attributes depended in large part upon classes. Frege treated 
of attributes of classes without looking upon such discourse as 
somehow reducible to a more fundamental form treating of attri- 
butes of attributes. Thus, whereas he spoke of attributes of attri- 
butes as second-level attributes, he rated the attributes of classes as 
of first level; for he took all classes as rock-bottom objects on a par 
with individuals.’ 


2. In the Jaws of the Press. When most of the second volume 
(1903) of Frege’s Grundgesetze der Arithmetik was in type and ready 
for printing, Russell wrote Frege announcing Russell’s paradox and 
showing that it could be proved in Frege’s system. In response 
Frege wrote an appendix to that second volume. This appendix 
was a report of the crisis and an essay at surveying the situation 
after the body of the book was beyond recall. It was to his transla- 
tion of this appendix that Geach attached the footnote quoted at 
the beginning of the present paper. 

According to Frege’s original system, every attribute ¢ had a 
class £(¢x) as its extension, and 


(3) E(gx) = £(px) .D (x) (bx = yx). 
The line he takes in his appendix is that of revoking (3). In so doing 
he impoverishes, to some degree, his universe of classes. He now 
allows attributes to have as their extensions the same class even 
where the attributes differ to the extent of holding and failing of 
some object. 

Readers of Principia know that Russell’s paradox of classes has 
a direct analogue in terms of attributes, and that the authors of 
Principia therefore found need to frame their theory of types for 
attributes as well as for classes. Indeed, the basic form of the theory 


$1898, §§23, 24, 35. See also Carnap, 1937, 138. 
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of types in Principia is the theory of types of attributes, or propo- 
sitional functions. The type structure is automatically inherited by 
classes, through the contextual definitions by which the theory of 
classes is derived in Principia from that of attributes. Readers of 
Principia may wonder, therefore, that Frege can leave his own 
universe of attributes intact, and molest only the classes. Can we 
not reproduce Russell’s paradox in Frege’s theory of attributes, 
without using his classes at all? 

We cannot. The reason is that Frege had, even before the dis- 
covery of Russell’s paradox, the theory of levels of attributes hinted 
at above; an anticipation, to some degree, of the theory of types.‘ 
If in response to Russell’s paradox Frege had elected to regiment 
his classes in levels corresponding to those of his attributes, his 
overall solution would have borne considerable resemblance to that 
in Principia. 

Actually it is not to be wondered that Frege did not think of this 
course, or, thinking of it, adopt it. It was by having all his classes 
at ground level that he was able to avoid the use of high-level attri- 
butes, and this he liked to do. He regularly avoided ascent in his 
hierarchy of attributes by resorting at appropriate points to classes 
as zero-level proxies of attributes. Thus, though he gives examples 
of third-level attributes and alludes to a continuation of the hier- 
archy,® in practice he uses no variables for attributes above the 
first level, nor constants for attributes above the second. 

Russell’s Principles of Mathematics, like the second volume of 
Grundgesetze, appeared in 1903. Russell had written Principles in 
ignorance of Frege’s work; but before Principles was off the press he 
discovered Frege’s writings and read various, including the first 
volume of Grundgesetze. There was time for him to add a long and 
appreciative appendix to Principles; time also, we have seen, for 
him to write Frege and evoke, in turn, an appendix in the second 
volume of Grundgesetze; and time even for him to see that appendix 
of Frege’s and add still a note to his own appendix to Principles 
(p. 522), as follows: 


The second volume of Gg., which appeared too late to be noticed in the 
Appendix, contains an interesting discussion of the contradiction... , 
suggesting that the solution is to be found by denying that two proposi- 


4 A somewhat clearer anticipation of the theory of types is to be found still 
earlier, in Schréder, 1890, 245-249. Cf. Church, 1939. 

5 Frege, 1893, §35. 

6 Tbid., §24. 
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tional functions which determine equal classes must be equivalent. As it 
seems very likely that this is the true solution, the reader is strongly 
recommended to examine Frege on this point. 


8. Frege’s Details. Thus far I have described Frege’s suggestion 
only incompletely, viz., as the rejection of (3). Russell in the above 
note describes it likewise, using different terminology. To be told 
that (3) is no longer to hold, or is no longer to retain its full gener- 
ality, is to be told merely in what general quarter the revision is to 
be sought, and not what the revision is to be. Frege went on, how- 
ever, in that same appendix to Grundgesetze, to make his suggestion 
more definite. Commenting on a step-by-step recapitulation of the 
argument of Russell’s paradox, he writes: 


In both cases we see that the exceptional case is constituted by the 
extension itself, in that it falls under only one of two concepts whose exten- 
sion it is; and we see that the occurrence of this exception can in no way 
be avoided. Accordingly the following suggests itself as the criterion for 
equality in extension: The extension of one concept coincides with that 
of another when every object that falls under the first concept, except 
the extension of the first concept, falls under the second concept likewise, 
and when every object that falls under the second concept, except the 
extension of the second concept, falls under the first concept likewise.’ 


What Frege calls “concepts” (Begriffe) in this passage are of course 
what I have been calling “attributes.” Thus Frege is proposing, 
contrary to (3), that at least for some attributes ¢ and ¥ we take 
£(¢x) and £(x) to be one and the same class even though ‘¢v = yz’ 
be false of one special object x, viz., that class itself.® 

Classically, the members of ¢(¢x) are such that 


(4) (y)ly « (ox) .= dy]. 


This implies (3). Frege, in departing from (3), is therefore depart- 
ing from (4), thus allowing the membership of £(¢x) to deviate 
somewhat from its classical composition. But he keeps the deviation 
to a minimum, thus providing? that 


(5) (y)ly # £(ox) .D: ye &(gx) .= dy). 


7 Geach and Black, 242f., translating Frege, 1903, 262. 

8] am now abandoning Whitehead and Russell’s class variables ‘a’, ‘p’, 
etc. (see §1 above), in favor of the general variables ‘xz’, ‘y’, etc. This procedure 
accords with Frege’s, and is the appropriate one in the absence of a theory of 
types. 

91903, 264. 
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This provision still leaves open the possibility, for any particular 
¢, that £(¢z) either comprise exactly the objects x such that ¢z, or 
comprise all of them plus £(¢z) itself, or comprise all except @(¢z) 
itself. Frege implicitly elects this last alternative for all choices of 
¢;° thus 


(6) (y)ly €&(ox) .=. y A £(Gz) . dy]. 


Note that even this stipulation does not quite determine £(¢z); to 
view (6) as a definition of ‘é(¢x)’ would involve circularity, because 
of the recurrence of ‘é(¢z)’ on the right. Frege expresses awareness 
of this lack of specificity, in a remark immediately following the 
longer passage which I quoted a page back. 


Obviously this cannot be taken as defining the extension of a concept, 
but only as specifying the distinctive property of this second-level function. 


What we have just been examining would seem to be the ‘“‘way 
out of Russell’s paradox’’ which, according to Geach, ‘‘has been 
followed by several later writers—e.g. by Quine, in Mathematical 
Logic.”’ But he must have meant to refer only to broader lines of 
Frege’s proposal; for the idea expressed in (5) (let alone (6)) has 
never been used, so far as I know, by anyone but Frege. This is 
cause for satisfaction, because the idea leads to contradiction in 
any universe of more than one member. 


4. The New Contradiction. Sobocinski reports (220ff.) that 
Lesniewski proved the inconsistency of Frege’s proposal in 1938. 
Lesniewski’s argument, as set forth by Sobocinski, is hard to dis- 
sociate from special features of Lesniewski’s system. But Geach, 
who lately brought this paper to my attention, has succeeded in 
reproducing the argument in a Fregean setting, obtaining a con- 
tradiction from (6) and: 


(7) (Ax) (Ay) (x ¥ y). 


Before hearing of these developments I had derived a contradic- 
tion using, in place of (6), the weaker assumption (5). My deriva- 
tion is as follows. I define ‘V’, ‘A’, ‘2’, and ‘W’ as ‘¢(x = 2)’, 
Wa ~ 2)’, ‘é(a = 2)’, and ‘&(z)(vez.zex._.z =z)’, and make 
use of four cases of (5): 


(8) Zyy ~wz.Diyewz.=.y = 2), 
QO} Wy+~W .dDiyeW.= @)yez.zey.D.y = 2), 
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and the cases corresponding to ‘V’ and ‘A’, which reduce quickly to: 


(10) yy #V.D.yeV), 
(11) (y(ye A.D. y = A). 
Two corollaries of (8) are: 

(12) ONO De ay 


(13) QDAlEGywey.yes.yrMz) ).sez.2z ie 
If we change ‘iy’ in (12) to ‘A’, then by (11) we can get ‘y = A’; so 


(14) (yy = A.D.y = A). 
If we change ‘vz’ to ‘z’ in (8) and reduce, we get ‘yez.).y = 2’; SO 
(1S) @OiQe —iz.vez.).y ==). 


If we change ‘V’ to ‘A’ in (10), then by (11) we can get ‘(y)(y = A)’, 
contrary to (7);so V # A. Hence, by (10) and (14), 


(16) AeV, LV ¥ A, uV ¥ A. 


By (8), A#uV.d:AeuV.=. A = cV. Hence, by (16), ~(A« 
uwV), whereas Ae V. So 


(17) uV # V. 

By. (15), vame= wey. preay. Day = 1. Henee; by (12), 

(18) (yyy = wy .D.y = vy = uy). 

From (17), by (18), we have “V # wV’, and thence, by (18) again, 
(19) uwV ~uV. 


If eV + W and ~(uV e« W), then, by (9), 
(Jz)(uVez.zeuV .uV ¥ 2), 


whence, by (13), wV ecV and uV # -V; but then, by (8), w.V = V, 
contrary to (17). So 


(20) eV = W.V.aVeW. 


Suppose .W = W. Then, by (15), «VY « W.)D.uV = W, and hence, 
by (20), uV = W. But then, by (19), .W # W. So 


(21) iW #W. 
Hence, by (12), 
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(22) W eiW. 
By (21) and (9), 
(23) We W.= (2)@Wez.zecW._).cW = 2). 


Hence .We W.D:t.We W. WeiW._). iW = W. So, by (21) and 
(22), 


(24) ~(W ¢ W). 


Then, by (23), (dZz)@Wez.zecW.cW #2), whence, by (13), 
iW e W, in contradiction to (24). 


5. A Kernel of Truth. It is not to Frege’s discredit that the ex- 
plicitly speculative appendix now under discussion, written against 
time in a crisis, should turn out to possess less scientific value than 
biographical interest. Over the past half century the piece has per- 
haps had dozens of sympathetic readers who, after a certain 
amount of tinkering, have dismissed it as the wrong guess of a man 
in a hurry. One such reader was probably Frege himself, sometime 
in the ensuing twenty-two years of his life. Another, presumably, 
was Russell. We must remember that Russell’s initial favorable 
reaction, quoted at the end of §2, was a hurried conjecture indeed; 
five years later we have, in significant contrast, his theory of types. 

In any event, what Russell actually described in that quoted 
note was not Frege’s full suggestion, but only its broadest feature: 
restriction, somehow, of (8). This feature, though Russell later 
turned his back on it, is a good one; and by working out from it we 
can find what Frege’s ill-starred appendix did contain pertinent to 
the subsequent course of logic. That broad feature, and more, can 
be said with some plausibility to have “‘been followed by several 
later writers—e.g. by Quine, in Mathematical Logic.” 

In order to implement the ensuing comparisons we must elimi- 
nate Frege’s reference to a realm of attributes. This necessity is re- 
marked, though in an inverted form, in the concluding parenthesis 
of Geach’s well-thumbed footnote. But the required elimination is 
less difficult than that parenthesis might lead one to expect, thanks 
to the circumstance that none of Frege’s remarks in this connection 
depend on any principle of individuation of attributes broader than 
the principle of individuation (viz., sheer sameness of spelling) of 
the corresponding open sentences themselves. Where Frege speaks 
of classes as extensions of attributes, we can speak metalogically of 
classes as determined by open sentences. 
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This shift requires no rewriting of (3)—(6) and related formulas. 
These merely change in status from open sentences, with attribute 
variables ‘¢’ and ‘fy’, to schemata with dummy sentences ‘¢z’, 
‘yx’, and ‘gy’. The difference, conspicuously immaterial here, 
would become material in cases where ‘¢’ occurs in a quantifier; 
such cases would cease to be allowable.’° But we shall not encounter 
any of them. 

To reject (3), as Frege did, was to allow (8) to be false for some 
attributes ¢ and y. Reconstrued now without reference to attri- 
butes, his rejection of (3) comes to consist in allowing (3) to be 
false under some substitutions of open sentences for ‘¢z’ and ‘yz’. 

Now it is time to point out an important difficulty in connection 
with rejecting (3). The difficulty lurks equally in Frege’s account 
and in Russell’s note. It is this: the prefix ‘¢’ has been understood, 
to begin with, as ‘the class of all and only the objects x such that’. 
If we presume to declare some case of (3) false, thus taking 
‘E(ox) = £(x)’ as true and ‘(x)(¢x = yz)’ as false (for some par- 
ticular choice of open sentences in place of ‘¢z’ and ‘yx’), then 
surely we have departed from that original reading of ‘é’, and left 
no clue as to what the classes (gx) and ¢(Wx) are supposed to be 
which are talked of in the allegedly true equation ‘é(¢x) = £(Wx)’. 
Either we must give some supplementary reading of the notation 
‘¢’, or we cannot read the presumed counter-instances of (38). 
Similar difficulties arise over the word ‘extension’ in Frege’s ac- 
count, and over the word ‘determine’ in Russell’s note. 

Earlier we observed that rejection of (3) involves rejection of 
(4). Let us accordingly concentrate on (4) instead of (3); for, it 
happens that the above difficulty as it affects (4) can be coped with. 
The reasoning is as follows. In case the open sentence represented 
as ‘fx’ happens to be such that some class z has as members all and 
only the objects x such that ¢z, certainly £(¢2) is still to be under- 
stood as z; it is only in the unfavorable cases that ‘é’ demands novel 
interpretation. The question whether to adhere to (4) for all open 
sentences, or to reject it for some, is thus simply the question 
whether to adhere to the abstraction principle: 


(Jz) (x) (a ez .= $2) 
for all open sentences or to reject it for some. Instead of talking of 
rejection of (4), therefore, we can talk of rejection of the abstrac- 


10 See §5 of I, above; also Methods of Logic, 22, 82, 91f., 129f., 203-208; also 
From a Logical Point of View, 8-11, 102-116. 
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tion principle. The useful difference between this principle and (4) 
is that whereas we cannot deny a case of (4) without facing the 
question what £(¢z) is thenceforward supposed to be, we can deny 
any case of the abstraction principle without having to recognize 
the prefix ‘é’ at all. 

All the well-known ways around the paradoxes, except for the 
theory of types, involve recognizing exceptions to the abstraction 
principle. This is not to be wondered at, since the abstraction prin- 
ciple yields contradiction by elementary logic as soon as ‘gz’ is 
taken as the open sentence ‘~(2 ex)’. Either you must declare 
some open sentences to be immune to the abstraction principle or, 
as is done in the theory of types, you must declare certain harmful 
forms such as ‘~(z ex)’ to be ungrammatical and not sentences at 
all. 

The theories which allow exceptions to the abstraction principle 
differ from one another, essentially, in where they fix the break 
between the cases which hold and the cases which fail. Zermelo was 
perhaps the first to perceive clearly this focal role of the abstraction 
principle; at any rate, his set theory (1908) was the first that pro- 
ceeded explicitly by postulation of instances and families of in- 
stances of that principle. 


6. Extension Extended. In turning our attention from (3) and (4) 
to the abstraction principle as formulated above, we came out 
where we need no longer worry about novel interpretations of the 
prefix ‘¢’, and can simply acquiesce in the inappropriateness of that 
prefix in connection with many open sentences. Some open sen- 
tences (those fulfilling the abstraction principle) happen to have 
extensions; others not. Frege himself touched upon that possibility, 
in the following words: 


. we must take into account the possibility that there are concepts 
with no extension (at any rate, none in the ordinary sense of the word)." 


However, he promptly and explicitly discarded that alternative, 
electing rather to require an extension in every case. He thus 
presented himself squarely with the problem of reconstruing 
‘£(¢x)’ compatibly with rejection of (3). Actually he cannot, de- 
spite the above passage, have hoped to retain extensions generally 
“in the ordinary sense of the word,” since this would mean retaining 
(4) and so (3). But he wanted there to be, for every case ‘dz’, an 
extension £(¢x) under some mildly variant reinterpretation of 
11 Geach and Black, 239, translating Frege, 1903, 257. 
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‘extension’ and ‘£(¢x)’. He essayed such a reinterpretation in his 
final suggestions, as noted above in connection with (6), but rec- 
ognized that he had not fixed it uniquely. 

Since Frege’s full suggestion leads to contradiction, there is no 
point in trying to devise a unique reinterpretation of ‘é(¢z)’ suited 
to exactly his principles. On the other hand, the general question of 
devising reinterpretations of ‘¢(¢x)’ for systems which allow excep- 
tions to the abstraction principle is worth considering. Narrowing 
the field a little, we might require of a satisfactory reinterpretation 
that #(¢x) continue to contain only objects x such that ¢z, even if 
it cannot at the same time exhaust them. Now given this condition, 
a further imperative becomes evident: £(¢x) must not be a subclass 
of any further class which contains only objects x such that ¢z. For, 
if £(@x) is not to exhaust the objects x such that ¢z, at least we 
want it to come as near as possible to doing so. 

How to reconstrue ‘é(¢x)’, conformably with the above require- 
ments, will depend on what cases of the abstraction principle hold; 
and this is the central point on which set theories differ from one 
another. The particular distribution of cases for which the principle 
holds, for a given set theory, can be such as to make any general 
interpretation of ‘é(¢x)’ conformable to the foregoing paragraph 
impossible. This can happen in either of two ways. It may happen, 
for some open sentences, that every class whose members all fulfill 
the sentence is a subclass of a further class whose members all ful- 
fill the sentence, so that there is no final class of the kind. Again it 
may happen, for some open sentences, that each of several classes 
qualifies under the requirements of the preceding paragraph, and 
there is no systematic way of choosing one from among them. 

Zermelo’s system (1908), von Neumann’s (1925), and that of my 
‘“New foundations’ (1937) are three set theories which differ dras- 
tically from one another in point of what cases of the abstraction 
principle are assumed to hold. Each of the three, however, is such 
as to render impossible any general interpretation of ‘¢(¢x)’ within 
the above requirements. In any such system the sensible course is 
simply not to call for an extension £(¢z), in any sense, for each open 
sentence ‘z’.!? 

A striking feature of von Neumann’s theory is that it, unlike its 
predecessors, provides for there being some classes which are not 

12 But one can still apply ‘z’, in its full classical import, to the particular 


open sentences for which the abstraction principle holds, defining this usage 
via singular description. Such is my procedure in ‘‘New foundations.” 
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members of any classes at all. Such classes I called non-elements 
when I gratefully carried his idea over into my Mathematical Logic. 
The set theory of Mathematical Logic, though depending heavily on 
the presence of non-elements, differs from von Neumann’s system 
in consequential ways. The cases of the abstraction principle which 
hold for my system differ from those which hold for von Neu- 
mann’s, and can be specified in an unusually simple way, as follows: 
the principle holds except when there are non-elements fulfilling 
‘pz’.13 Now for this system, unlike von Neumann’s, Zermelo’s, and 
that of “New foundations,”’ a general interpretation of ‘%(¢x)’ zs 
available which conforms to the requirements lately set forth. It 
turns out to be this: £(¢z) is the class of all elements x such that ¢z. 
Formally stated, £(¢z) is the y such that (x)[z « y .=.(4z)(x € z) . da]. 

The above description of the system of Mathematical Logic does 
not determine it in full, but applies still to a whole genus of sys- 
tems.'* The cases of the abstraction principle which hold for my 
system may seem just now to have been fully specified, leaving no 
latitude for further variation; but this is not so, for the above speci- 
fication appeals to the distinction between element and non-ele- 
ment, and we are still free to draw this line very much as we choose. 


7. Frege and the Later Trends. Let us consider, in summary sur- 
vey, how the suggestions in Frege’s appendix relate to set theory 
from Zermelo onward. Frege’s rejection of (3) implies, we observed, 
rejecting the universality of the abstraction principle; and this 
latter departure is a focal point of modern foundations of set theory, 
from Zermelo onward, aside from the theory of types. But the claim 
for Frege is weak here. Frege’s eyes were on the extension £(¢z), 
and how to preserve it or a reasonable facsimile in the general case. 
Zermelo had no such preoccupation, and faced the problem of set- 
theoretic foundations squarely in terms of the cases of the abstrac- 
tion principle. Significantly, neither Zermelo’s set theory nor von 
Neumann’s nor that of my “New foundations” will accommodate 
the remotest analogue of an extension operator applicable to open 
sentences generally. Zermelo, rather than Frege, stands out as the 


18 This can be seen by reflecting that in *202 of Mathematical Logic the clause 
of elementhood is dispensable if and only if it is implied by the formula which 
stands in conjunction with it. 

“ B.g. my “Element and number,’’ above; Wang, ‘‘A new theory of element 
and number.” The earliest instance is perhaps the system of the first edition of 
Mathematical Logic, which, however, proved inconsistent. The emendation of 
the system, used in the revised edition, is due to Wang (1950). 
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pioneer in the genre of set theories represented by these three. Not 
but that von Neumann in turn added a drastic innovation, in his 
admission of non-elementhood. Moreover ‘‘New foundations,”’ 
though like Zermelo’s in that it proceeds by selective adoption of 
cases of the abstraction principle, is in equal measure indebted to 
Russell’s theory of types. The principle determining what cases of 
the abstraction principle are to hold for ‘““New foundations” is an 
adaptation of a formal feature of type theory. 

If Frege’s preoccupation with a generally applicable analogue of 
the classical extension operator tends to estrange him from the 
above company, it should count correspondingly in favor of an 
affinity between his suggestions and my Mathematical Logic. For in 
the latter system there is, we have seen, a generally applicable 
adaptation of the extension operator. Moreover, to turn to a 
prominent individual case, the extension £ ~(xe2) fails to be a 
member of itself, in Mathematical Logic as in Frege’s plan; this 
conformity was remarked in Geach’s footnote. 

Yet the differences are great. Frege’s extension operator resisted 
actual interpretation, but was supposed, in each application, to 
approximate to the classical extension to within one possible 
discrepancy of membership. Mine is defined, and admits of infinite 
discrepancies. Furthermore von Neumann’s non-elementhood 
concept, utterly un-Fregean, is of the essence of Mathematical 
Logic. Incidentally there is the inconsistency of Frege’s specific 
proposal; this is indeed a point of similarity with the first edition 
of Mathematical Logic, but quite possibly a point of contrast with 
the current edition, thanks to Wang’s brilliant repair. 

A passing remark, finally, on genesis. We discerned, in the de- 
mand for a generally applicable adaptation of the extension 
operator, a particular resemblance which Frege’s proposal bears 
to the system of Mathematical Logic and not to previous set theories 
such as Zermelo’s, von Neumann’s, and that of ‘‘New founda- 
tions.”” Now this particular resemblance is due not to heredity 
but to convergence: response to common demands of algorithmic 
facility. The ancestry of Mathematical Logic is an open book; the 
parents are von Neumann’s system and “New foundations,” both 
of whose genealogies were looked into a page back. 

All of modern logic owes an incalculable debt to Frege. If any- 
one can be singled out as the founder of mathematical logic, it is 
by all odds he. But I have been concerned in these pages, like 
Geach and Black in the remarks which I quoted from them, only 
with the hurried appendix to the second volume of Grundgeseize. 


AXILT 


«t Completeness of 
the Propositional 
Calculus 


The completeness of the propositional calculus was first proved 
by Post, 1921. His somewhat condensed proof has been succeeded 
by more detailed presentations of substantially the same argu- 
ment,! and also by several proofs of radically different forms.? 
The present paper contains still another proof, offered because 
of its relative simplicity. In part this proof follows a plan which 
was sketched by Wajsberg? for another purpose, viz., for proving 
the completeness of the sub-calculus involving only the material 
conditional. 

For the present proof a systematization of the propositional 
calculus will be used which is due to Tarski, Bernays, and Wajs- 
berg.* It involves the material conditional ‘>’ and the falsehood 
‘E’ as primitive; thus the formulae are recursively describable as 
comprising ‘F’, the variables ‘p’, ‘q’, ‘r’, . . . , and all results of 
putting formulae for ‘p’ and ‘q’ in ‘(p D q)’. The denial ‘~p’ is 
definable as ‘(p ) F)’, and all other truth functions are then de- 
finable in familiar fashion. (Conversely, in a system admitting ‘~’ 
Reprinted from the Journal of Symbolic Logic, Volume 3, 1938. 

— and Ackermann, 1928, 9-29, 33; Hilbert and Bernays, Vol. 1, 
' + Lukasiewicr, “Kin Vollstandigkeitsbeweis’; Kalm4r, 1935; Hermes and 
Cholz. 


3154-157. 
4See Wajsberg, 132, 157-159. 
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instead of ‘F’ as primitive, ‘F’ might be explained as an abbrevia- 
tion of ‘~(p > p~p)’.) The postulates are four: 


(1) (p> 4) > (@D7) De> 
(2) (@ >.¢@Y D®) D®) 

(3) @D @ D7) 

(4) ep). 


Theorems are derived by substitution and modus ponens as usual; 
among them are the following: 


(5) (s > s) 
(6) (oma Dr) D (ED @ D> 2) 
(7) (GD @D7) D(eD) DC JIGS, 
(8) (s DI@ D @) D (¢ Dr) D6 DID) 
(9) (emis) BG 1) S(t) u) D @ >a) 
Go) “¢ >G 3H) D (@ D7) D> (@D 5s) D Gla 
My) @@ODC6D))D@ Dn) D(eDs) DeLee 
wey ie D> @) (Ss D7) DEDCD eT) 
qs) "@) G's) 
q@4) «@ > @G@Ds))) 
p(n). o) 7) DG) p)) 
(Gm -) g) Dr). D. (oe 1) Dt) 
G7) Km (s Ds) D (@ Dw) DT) 2D Ga 7) 
is @)s) D (¢ DD De) DG wD 
Ge) §6(@)D CDs) D 
(qD> Duy) DeD(GbDI)DICAAD 

Zo wip DG G)s))> 

(p>) @DtDw))dI@®@DdEedGDdID¢CoeF 
G21) @D(p~Dd—D7) DD © D®) DGD 2a 
22). ((pDq) 5D (sD 7) D ( €D*) DS @ Cae 
2 (¢D(F DD) DT) Ss Ge) 
(244) (FDR 
(25) (FDO EFDF) 
@o) (@DF) >) fD») 
ee (dF DF) DF) DF). 


The derivation of (5)-(12) from (1)-(3) is recorded elsewhere.°® 
(13)-(27) are proved as follows:® 


5In my System of Logistic, 68, 72-74: derivation of 2-5, 2-7, 2-86, 2-87, 
2-94, 2:96, 2:97, and 2-99 from 2-1, 2-2, and 2-3. 
6 Concerning the notation see ibid., pp. 61-71. 
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()—(5) D (13). (3)—(13) D (14). 

y= (Dip/(p 2 @); ¢/7; 7/p) Dal) © @). 

@)—(15) >. (15) (p/7; g/p; r/p) DU). 

OP) —(7)\s/ (18); ¢/(¢ D p)) D: @) D. G3) DAZ). 

m0) ». Oi@/(s > 4); t/C D &)3@/% D @) OD (8). 
(10)—(18) > (19). (10)—(19) > (20). (11)—(16) > (21). 
(11)—(21) >: (6)(p/(p D 4); t/s) D. (6) D (22). 

(6)—(6) (p/q; ¢/(4)) D. (4) D (23). 

(5)—(24). (3)—(25). (5)—(26). (2)—(27). 


A formula x will be called regular if, for all ¢, ¥, and w such that 
w results from putting Y for 20 occurrences of ¢ in x, 


EG D® D@ ae) 2D)" (a) 
and Ee Da D(@ Be) De Dig)’ (b) 


It will be proved that every formula is regular. 
(1) Every formula with no occurrences of ‘.’ ts regular. 


Proof. Let x have no occurrences of ‘’, and let w result from 
putting y for 20 occurrences of gin x. Then x is a single letter (‘F’ 
or a variable) ; hence either w is x, in which case (a) and (b) follow 
from (14), or else w is y and x is ¢, in which case (a) follows from 
(3) and (b) from (13). 


(II) Lf every formula with Sm occurrences of ‘ ’ ts regular, so ts 
every formula with m + 1. 


Proof. Let x have m + 1 occurrences of ‘—’, and let w result from 
putting y for 20 occurrences of ¢ in x. In the trivial case where ¢ 
is x and w is y, (a) follows directly from (8) and (b) from (13). In 
all other cases x is "(x1 > x2)! and w is "(w: > we)', where w, and 
we result from putting y for 20 occurrences of ¢ in xi and x2. But 
Xi and x2 have Sm occurrences of ‘>’. Hence, by hp, 


Pie DY) > (YD) D Ga Die)", (i) 
E'((¢D ¥) D (HD 4) D @D u)))' (il) 
E'((@ DY) D (HD 9) D Ga D w&)))), (ili) 
and E'((¢ D ¥) D (WD $) D (@: D x2)))" (iv) 


By (20), + ‘[@i)] D. [Gii)] D [(a)]7 and F'[@] D. [Gv)] D [(b)P- 
(III) Every formula is regular. Proof: (1), (11). 
Three corollaries follow: 


7 Concerning the metamathematical notation see VII above. 
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(IV) If wis formed from x by putting ‘(F > F)’ for 20 occurrences 
of ¢, then H'(¢ D (x D @)) and F(¢ D ( D x)". 


Proof. By (1II), 
E@ DD FPF) SD (PDF) OW) D&D oP ae 
amd = "'((¢ D@ OF) OUP OF) D4) ely) ae 


By (17), F'[@)] D (6 D (x D @)) "and F[Gi)] D @ D (@ D x)" 


(V) If y ts formed from x by putting ‘¥’ for 20 occurrences of 4, 
then |-'((@ DF) D&D) and F(¢ DY) DW Dy)” 


Prog. By (UL), Fe DE) DUP De) Doo (i) 
and ENS DF) (GO) DY Dx 


By (23), FI@]D (@DHND&DY)' and -1H@)) DED 
Ta) OL 


(VI) If ~ and w are formed as in (V) and (IV), then E'(WD 
(wD x))% 


Broo, By (VY), I-'(@ DF) Do wy) - (i) 
By (1V), a) DOW 2a) (ii) 


By (22), F'[@] D. [Gi)] D &@ D @D x)" 


Preparatory to showing that every tautology is a theorem, the 
notion of tautology must be sharply formulated. Roughly, ¢ is a 
tautology if every result of substituting ‘I’ for 20 variables of ¢, 
and ‘T”’ for the rest, reduces to “T”’ by the truth-table process—i.e., 
by a continued process of putting ‘F’ for ‘(T > F)’ and ‘T” for 
(F > FP)’, ‘(FD T)’, and ‘(T > T)’. Hence, if by way of adhering 
to the notation of the system we use ‘(F > F)’ instead of the letter 
“T’, we can formulate the notion of tautology as follows. ¢ is said to 
reduce to y if there are ¢o, di, . . . , on(r» 2 O) such that ¢p is 4, 
gn is, and, for each 7 from 1 ton, ¢; is formed from ¢;_1 by putting 
‘Y’ for ‘(FD F) DF)’ (Case 1) or by putting ‘(F > F)’ for 
‘(Ff > (FD F))’ (Case 2) or for ‘((F D> F) D (FD F))’ (Case 3). 
An evaluant of ¢ is any result of substituting ‘F’ for 20 variables 
and ‘(F > F)’ for all other variables in ¢. ¢ is called a tautology if 
all its evaluants reduce to ‘(F } F)’. 


(VII) If ¢ reduces to ¥, K'(6 D Wy) and F'(W D ¢)". 

Proof. In Case 1 (see above), by (V), F'(27) D (¢:-1 D ¢;) 'and 
'(27) D (¢: D ¢:-1)"; in Case 2, by (IV), F (25) D (1 Day 
and + '(25) D (¢; D ¢:-1)1; and in Case 3, by (IV), -'(26) D 
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(¢:-1 > ¢:)' and -'(26)D (¢:D eu)". Thus -'(¢>D ¢i)', 
LM (dy _) do) |, mare ary tN (ba-1 ») yyy also L(y » ee) LM (dn-1 
ae), ..., —'@i De) Mtenee -'@Dy)' and -"e> 
¢)', by use of (1). 

(VIII) If ¢ ts a tautology without variables, / ¢. 


Proof. Being its own evaluant, and a tautology, ¢ reduces to 
‘(F > F)’. Hence, by (VII), + '(24) D ¢". 


(IX) If tr for all tautologies + with n variables, so also for those 
wenn + 1. 


Proof. Let x be a tautology with n + 1 variables, one of which is 
@. Let w and w result respectively from substituting ‘F’ and 
‘(F > F)’ for ¢ in x. Then y and w are tautologies: for all their 
evaluants are evaluants of the tautology x and hence reduce to 
‘(F > F)’. Also, y and w have just 7 variables. Hence, by hp, Fy 
and Fw. But, by (VI), E'(W D ( D x))". 


(X) If $ ts a tautology, + ¢. Proof: (VIID), (IX). 


The completeness of the system in one sense is thus established. 
It remains to show that the system is complete also in Post’s sense; 
viz., that from any indemonstrable formula ¢ as premiss every 
formula x is derivable. By (X), ¢ is not a tautology; hence ¢ has an 
evaluant y which does not reduce to ‘(F > F)’. But, being a for- 
mula without variables, Y must reduce to ‘(F > F)’ or ‘F’; hence to 
‘F’. Therefore, by (VII), E'(¥ D F)". Thus ‘F’ is derivable from 
y, which is in turn derivable from ¢ by substitution. But from ‘F’ 
by (4) we can derive ‘p’, and thence, by substitution, x. 


XLV 


at On Cores and Prime 
Implicants of Truth 


Functions 


What is called a truth-functional formula in (alternational) normal 


: ; pe ee) . : 
} ? be) 
form is built up of sentence letters ‘p’, ‘q’, etc., or their negations 


‘pn’, ‘g’, etc., or both, by using only the notations of conjunction 
and alternation, and in such a way as to subject alternations never 
to conjunction but only vice versa: thus ‘pg V prs V 7’. This paper 
is concerned with the problem of reducing an arbitrary truth-func- 
tional formula to a shortest equivalent in normal form. Part of the 
content of my last previous paper on the subject! will be presented 
anew in an improved way, and a further theorem will be estab- 
lished. I shall not assume familiarity with my previous papers. 

Let us sharpen our terminology. Sentence letters and their nega- 
tions are called literals. Literals and conjunctions of them are called 
fundamental formulas, provided that none contains the same letter 
twice. Fundamental formulas and alternations of them are called 
normal, and are said to have those fundamental formulas as their 
clauses. On these definitions, a formula is convertible to normal 
form only if it is not self-contradictory; but there is no serious loss 
in setting aside the self-contradictory cases. 

A prime implicant of a formula ® is a fundamental formula that 


Reprinted from the American Mathematical Monthly, Volume 66, 1959. 
1A way to simplify truth functions.’’ Cited hereafter as WSTF. 
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logically implies but ceases to when deprived of any one literal. 
A normal formula will be called unzliterally redundant if it is equiva- 
lent to what remains of itself on dropping some one occurrence of a 
literal. Obviously then a normal formula is uniliterally redundant 
if and only if not all its clauses are prime implicants of it. Conse- 
quently, in particular, 


(1) Any shortest normal equivalent of a formula ® will be an alterna- 
tion of prime implicants of ®. 


Ignoring tautologies along with self-contradictions, as will be 
convenient hereafter, one finds that 


(II) The prime implicants of a formula exist and are finite in num- 
ber. 


To see that they exist, consider a formula @ and any assignment 
of truth values to the letters of ® that verifies —say truth to ‘p’, 
‘q’, and ‘s’ and falsity to ‘r’. The assignment determines a funda- 
mental formula—‘pg7s’, in this example—that implies #; and it, 
or part of it, is a prime implicant of ®. To see further that the prime 
implicants of ® are finite in number, we just reflect that they are 
bound to contain no letters foreign to ©, since any such letter could 
be dropped without affecting the implication. 


(III) A formula ts equivalent to the alternation of all tts prime 
amplicants. 


For a formula @ is implied by each of its prime implicants and 
hence by their alternation; and conversely every truth assignment 
that verifies verifies some ®-implying fundamental formula (cf. 
‘pgqrs’ above) and therewith some prime implicant (viz., that funda- 
mental formula or part of it) and therewith the alternation of the 
prime implicants. 

In view of (I) and (III), a formula can be transformed into a 
shortest normal equivalent in two stages: (A) transform it into the 
alternation of all its prime implicants and then (B) delete from that 
alternation the largest possible combination of jointly superfluous 
clauses. (A) can be accomplished by a technique due to Samson 
and Mills, the explanation of which calls for two more definitions. 
If two fundamental formulas ¢ and yw are opposed in exactly one 
letter (so that ¢ contains ‘p’ affirmatively, say, and W contains ‘p’), 
then ¢ and w will be said to have as their consensus the formula 
which we get from the conjunction ¢y by deleting the two opposed 
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literals and any repetitions.? If @ and W are fundamental formulas 
opposed in no letter, and all letters (hence all literals) of y are in 4, 
then ¢ will be said to (notationally) subsume wy.’ 

Preparatory to the business of (A), we may suppose ® put into 
normal form by familiar logical procedures. Now the discovery of 
Samson and Mills is that , thus prepared, goes over into the alter- 
nation of its prime implicants if we persevere in these two equiva- 
lence transformations: 

(i) If a clause subsumes another, drop the former. 

(ii) Adjoin, as an additional clause, the consensus of two clauses 
(unless it subsumes a clause already present). 

That (i)—-(ii) are bound eventually to convert © into the alterna- 
tion of just its prime implicants, and all of them (ignoring differ- 
ences of order in a conjunction), is proved by proving the following 
four theorems. 


(IV) A normal formula remains susceptible to (11) as long as some 
prime implicant of 1t is not a clause of it (to within a permutation of 
a conjunction). 


(V) A normal formula remains susceptible to (i) or (ii) as long as 
some clause of tt 1s not a prime implicant of 2t. 


(VI) Normal formulas go into normal formulas under (i) and (ii). 
(VII)! No normal formula ts susceptible to (1)—(ai) without end. 


Proof of (1V). Let & be a normal formula and x a prime implicant 
of it that differs Gn more than order) from all clauses. Then also, 
being prime, x subsumes no clause. So there is at least. one funda- 
mental formula, x anyway, that has the three properties of (a) 
subsuming x, (b) subsuming no clause of , and (c) exhibiting no 
letters foreign to & (cf. end of proof of (II)). Moreover, there is an 
obvious limit to how long a fundamental formula having property 


2 This definition of consensus is more liberal than that in WSTF, in that it 
counts ¢ as consensus of a¢ and & (and of &¢@ and a). (i) below is consequently 
simpler than its counterpart in WSTF, and correspondingly for the proof of 
(IV) below. This liberalization of the definition is due essentially to Bing, 
but note that I avoid his “void formula.” 

3 My terminology has proved confusing. When ¢ subsumes y in my intended 
notational sense of having among its literals all those of y, then, precisely, y 
subsumes ¢ in a certain logical sense: y has among its verifying truth-value 
assignments all those of ¢. 

4'This theorem and its proof, which I neglected to include in WSTF, are 
given here on the appreciated advice of the referee. Note that the parenthetical 
proviso in (ii), which may look dispensable in view of (i), is needed to assure 
(VII)—as is indeed remarked in effect in WSTF at the top of p. 628. 
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(c) can be. So there is at least one fundamental formula, call it y, 
that has the three properties (a)—(c) and is exceeded in length by 
no fundamental formula having those properties. Yet y does not 
exhibit all letters of ©; for, if it did, then, having also property (b) 
as it does, Y would oppose every clause of ® in one or another letter, 
and so not imply ®; whereas actually y must imply ®, having prop- 
erty (a). So y lacks some letter of &, say ‘p’. Since y is a longest 
fundamental formula with the properties (a)—(c), py must lack one 
of those properties and so must jy; yet obviously not (a) or (c); so 
(b). So there are clauses ¢; and ¢2 of © such that py subsumes ¢; 
and jy subsumes ¢e. But y, having the property (b), subsumes 
neither; so ‘p’ and ‘pj’ must occur respectively in ¢; and @». Still di 
and ¢2 are not just ‘p’ and ‘p’, or & would be tautologous; nor are 
¢1 and ¢2 opposed in letters other than ‘p’, since their further literals 
are common to y. So ¢; and ¢2 have a consensus, say ¢. Moreover ¢ 
subsumes no clause of ®; for y subsumes ¢, and y has the property 
(b). So ¢ can be added by (ii). 


Proof of (V). Let ® be a normal formula and ¢ any clause of it 
that is not a prime implicant. Then ¢, implying ®, subsumes some 
prime implicant y. If y is a clause of © (to within a permutation of a 
conjunction), we can apply (i) to drop ¢; and otherwise we can 
apply (ii) in view of (IV). 


Proof of (VI). Obvious. 


Proof of (V11). Elimination of a clause ¢ by (i) depends on there 
surviving some clause ¢’ that ¢ subsumes; elimination of ¢’ in turn 
depends on there still surviving some clause ¢”’ that ¢’ (and hence 
¢) subsumes; and so on. Therefore, in view of the parenthetical 
part of (11), no clause ¢ once dropped by (i) can ever be restored by 
(ii). But neither, in view of the parenthetical part of (ii), can a 
clause already present be reintroduced in duplicate by (ii). To sum 
up, (ji) can never introduce the same clause twice. But there are 
only finitely many different fundamental formulas for (ii) to intro- 
duce, since no letters foreign to are drawn on. So the use of (ii) 
must terminate. Also the use of (i), being subtractive, obviously 
must terminate. 

So much for task (A), the conversion of a formula into the alter- 
nation of all its prime implicants. Toward the continuation task 
(B), that of determining what largest combination of the resulting 
clauses can be dropped as jointly superfluous, systematic methods 
have been developed by Ghazala. 
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But a shortcoming of the whole (A)-(B) approach is that it de- 
pends on exhausting the prime implicants; for, as Rolf K. Miller 
remarked to me in 1955, their number can even exceed the total 
number of possible truth-value assignments when there are many 
letters. According to Fridshal, this can happen after five letters. He 
cites a formula in nine letters which, by his computations, has 1698 
prime implicants—whereas the total number of possible truth- 
value assignments to nine letters is only 512. So a general technique 
is needed for finding an adequate minimum alternation of prime 
implicants without handling all prime implicants on the way. 

A normal formula may be called clausally redundant if it is 
equivalent to what remains of itself on dropping one of its clauses 
of alternation; and zrredundant, simply, if it is neither clausally nor 
uniliterally redundant. Obviously any shortest normal equivalent 
of a formula will be irredundant. Happily we can render a normal 
formula irredundant without regard to the totality of its prime 
implicants, simply ridding the formula of superfluous clauses and 
superfluous literals one by one. To see whether a clause y is super- 
fluous in ® V y, we just check whether ¥ implies ®. Such implica- 
tion cannot in general be recognized by mere notational subsump- 
tion, as can implication between fundamental formulas, but it can 
be checked very simply: we have merely to mark the literals of y as 
true and see whether # thereupon reduces to a tautology. Again, to 
see whether a literal ¢ is superfluous in ® V ¢y, we just check, in 
the same swift way, whether y implies ® V ¢. For 


(VIII) y tmplies  V ¢ af and only if ® V th 1s equivalent to 
® V y. 


For, the conditional y D.@ V ¢ is verifiably equivalent to the 
biconditional ® V ty .=.@ V y, and hence tautologous if and only 
if the biconditional is tautologous. 

Unhappily, however, such reduction of a formula to an irredun- 
dant equivalent need not deliver a shortest.5 What it delivers may 
well lack some of the clauses of every shortest normal equivalent, 
and contain clauses shared by no shortest normal equivalent. 

At the same time there commonly will be, given a formula ©, 
certain clauses that are bound to appear in every irredundant 
equivalent of @, and hence in any shortest. The alternation of such 
clauses I call the core of 6. Commonly also there will be, at the 
opposite extreme, prime implicants of ® that are bound never to 


6 E.g., ‘ng V pq V gf V Gr’ is irredundant but equivalent to ‘pg V gf V pr’. 
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occur as clauses of irredundant equivalents of &. For a prime im- 
plicant of & to be thus absolutely superfluous, it is obviously sufh- 
cient that it imply the core of @; but I do not know whether this is 
necessary.°® 

I shall explain a test whereby, without exhausting the prime 
implicants of a formula, we can identify its core. This done, we 
obviously can then quickly decide also, of a prime implicant, 
whether it implies the core (and is thus absolutely superfluous). 
Query: Given an irredundant formula, can all its absent prime 
implicants that are not absolutely superfluous be reached by the 
operation (ii) of iterated consensus-taking without interim reten- 
tion of any absolutely superfluous ones? This, if true, would allow 
us to shortcut the (A)-(B) approach to the extent of leaving some 
of the prime implicants ungenerated in some cases; but I do not 
know it to be true, and anyway the saving would be limited to 
formulas with cores. 

Guesses aside, there are cases where identifying the core of an 
irredundant formula settles the question of shortest normal equiva- 
lent without further ado. Thus if every clause belongs to the core, 
the formula is already as short as possible. If every clause but one 
belongs to the core, then again our simplification is good enough; 
the only possible improvement would be the negligible one of find- 
ing some shorter clause to take the place of the odd one. 

But in the general case a technique of core identification has, 
alas, no evident bearing on our central problem: that of finding 
shortest normal equivalents without exhausting prime implicants. 
It seems worth communicating mainly for what it may contribute 
to one’s understanding of the general workings of irredundant 
formulas. 

Criterion, given an irredundant formula %, of whether a clause ¢ 
thereof belongs to the core: From & delete ¢, also each clause that 
opposes ¢ in more than one letter, and finally all literals whose 
letters are in ¢; what remains will be a tautology if and only if ¢ 
does not belong to the core of &. Examples: ‘pq’ does not belong to 
the core of ‘pq V gs V qf V Gr’, for ‘s V = V rs’ is tautologous; 
whereas ‘gs’ belongs to the core, for ‘p V 7’ is not tautologous. 

That the criterion is geared only to irredundant formulas is no 


6 Note added 1965: It is not, as eight readers have proved to me. The first of 
the counterinstances came from William E. Glass under date of December 15, 
1959; namely “pF V gr V G8 V qs’, whose core ‘p¥ V gs V 8’ is implied by 
neither of two absolutely dispensable prime implicants, ‘pq’ and ‘ps’. 
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limitation, for we have seen how to make a formula irredundant. 
That the criterion enables us to spot as core clauses only clauses 
already present is again no limitation, for, by definition, in any 
irredundant formula all the core clauses are present. Actually 
uniliteral irredundancy would suffice here without clausal; but it 
is more efficient to start from a fully irredundant formula, since 
there are then fewer clauses to examine. 

It remains to justify the criterion by proving that a clause of an 
irredundant formula belongs to the core if and only if it meets the 
stated test. In other words, what is wanted is the theorem: 


(IX) A clause ¢ of an trredundant © belongs to the core of & if and 
only uf there 1s an assignment of truth values to letters not in ¢ that 
falsifies all clauses of &, other than ¢, that oppose ¢ in at most one 
letter. (I assume as usual that ® is not simply a tautology a V @.) 


Proof.” Let ® be any irredundant formula, ¢ any clause of @, and 
W the alternation of all prime implicants of @ but ¢. Then ¢ is in the 
core of @ if and only if W is not equivalent to ¢ V W (and thus to 
&); hence if and only if ¢ does not imply VY. So what is to be proved 
is (a) that if ¢ does not imply WV then some assignment to letters not 
in ¢ falsifies all clauses of , other than ¢, that oppose ¢ in at most 
one letter, and (b) that if ¢ implies YW then each assignment to 
letters not in ¢ is compatible with some clause of %, besides ¢, that 
opposes ¢ in at most one letter. 


Proof of (a). Since @ does not imply Y, some assignment A to the 
letters of ® verifies @ and falsifies ¥. Let A’ be the part of A that 
has to do with letters not in ¢, and let y be any clause of ® other 
than ¢ that opposes ¢ in at most one letter; to show that A’ falsifies 
y. Case 1. Y opposes ¢ in no letter. Then, since A verifies ¢, A 
falsifies no literals of y whose letters are in ¢; but A does falsify y, 
for y is clearly a clause of VY, and A falsifies Y; so A’ falsifies y. 
Case 2. w has just one literal, call it £, opposed to a literal of ¢. But 
y and ¢ are not simply ¢ and ¢, or © would be a tautology. So w and 
@ have a consensus, and it, since it implies ¥ V ¢ and therefore 4, 
subsumes a prime implicant x of ®. Since x lacks ¢, x 1s not ¢; 
hence x is a clause of WV; and hence A falsifies x. But all literals of x 
whose letters are in @ are identical with literals of ¢, and hence 
verified by A; so A’ falsifies at least one literal of x. But any such 
literal of x is a literal of y. So A’ falsifies y. 


7 Previously proved in one of my lectures on “Simplifying truth functions,” 
College of Engineering, University of Michigan, June 1958. 
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Proof of (b). Here the assumption is that ¢ implies VW; i.e., that ® 
and W are equivalent. Let A’ be any assignment to the letters not 
in ¢. Let ©’ and W’ be what © and W reduce to under A’; thus ®’ and 
W’ are equivalent. It will be sufficient to show that ©’ has a clause, 
other than ¢, that opposes ¢ in at most one letter. Now ¢, being a 
clause still of ©’, implies ®’ and therefore W’. Hence ¢ must sub- 
sume some clause yf’ of ¥’; for @ subsumes every clause of W’ not 
opposed to ¢, there being no further letters. This y’ is the residue 
in W’ of some clause y of WV. Since ¢ is not a clause of V, y is not ¢; 
moreover, since any clause of VW is a prime implicant of ®, » does 
not even subsume ¢. So y lacks a literal ¢ of 6; ¢ subsumes (y’. Now 
assign falsity to ¢ and truth to the rest of the literals of ¢. This 
assignment verifies ¥’, and therewith W’; hence also ©’, which is 
equivalent to v’. Therefore this assignment verifies some clause of 
&’, But a clause of &’, to be thus verified, must consist solely of 
literals of ¢ other than ¢, plus perhaps ¢£; hence it is other than ¢ and 
opposes ¢ in at most one letter. 


XV 


@t [wo Theorems about 
Truth Functions 


It will be convenient simply to disregard order in a conjunction, 
thus treating the fundamental formulas ‘p@r’, ‘prq’, ‘rgp’, etc., not 
merely as equivalents but as one and the same formula. From this 
point of view the fundamental formulas which contain all and only 
n given statement letters are just 2* in number. Given any n state- 
ment letters, listed in an arbitrary order ai, a2, ... , Qn, the 2” 
fundamental formulas containing those letters can be listed ex- 


haustively in a convenient standard order as follows: 


r a VG = ty = 1 
(1) ‘aya, . . . Gn’, ‘aide... AnuiGn', ‘102 . . . GnDeyoieee 

ie eal. Ts x a 

12 2. . « An—2Xn-1An y 1012 «1 » An—ZQn—2%n-1Xn , oe oy 

GG. . . a 


There is a quick 7mplication criterion for any two normal formulas 
® and Y such that W lacks negation signs; viz., implies V if and 
only if each clause of subsumes a clause of Y. That such subsump- 
tion is sufficient in order that ® imply W is seen as follows. Sup- 
pose each clause of & subsumes, and therefore implies, a clause of 
Y; then, since each clause of Y implies VY, each clause of ® implies 
WV; and accordingly © implies V. That the subsumption condition 


This paper was presented by invitation, but in absentia, at the Mexican Scien- 
tific Congress, September 1951, and appeared in the Boletin de la Sociedad 
Matematica Mexicana in 1953. I have been able to omit portions explanatory 
of terms and notation, thanks to foregoing papers. A corollary is inserted from 
the end of my paper ‘‘The problem of simplifying truth functions,’’ American 
Mathematical Monthly, 1952. 
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is also necessary is seen as follows. Suppose some clause ‘ajaz. . . . 
OmpiB2 . . . Bn'(m = 0,n = 0) of & subsumes no clause of W. This 
is the same as supposing (since V lacks negation signs) that every 
clause y; of Y contains a letter y; other than a, ... , am. If we 
assign truth to a, ... , &m and falsity to all other letters, then 
aide... OmBibs ... Bn! comes out true. (Note that the §’s 
repeat no a’s, by the definition of “fundamental formula’’.) There- 
fore ® comes out true. On the other hand y; for each 27 comes out 
false under the described assignment of truth values, on account 
of y; (for remember that WV lacks negation signs); so Y comes out 
false. Therefore © does not imply W. 

An obvious expedient for shortening normal formulae is that of 
simply deleting clauses which subsume other clauses (thus exploit- 
ing the familiar equivalence of ‘pg V p’ to ‘p’). Now it can be 
proved that this expedient always eventuates in a shortest normal 
equivalent if the normal formula with which we begin lacks nega- 
tion. Such is the content of 


THeoreM I. Jf a formula zs normal and lacks negation and none of 
ats clauses subsumes any other of tts clauses, then it has no shorter 
normal equivalent. 


Proof. Suppose (i) that W is a normal formula lacking negation, 
(ii) that no clause of Y subsumes another clause of V, and (iii) that 
® is normal and equivalent to VY; to prove that @ is no shorter than 
W. By (i) and (iii) and the above implication criterion, each clause 
of subsumes a clause of ¥. Let the thus subsumed clauses of VW be 
Yi, . . . , Yn. Since to subsume is to imply, each clause of ® im- 
plies one of yi, ..., wa; hence ® implies "hyiV... V Wal 
By (iii), then, Y implies '¥, V ... V wn! Then, since "yi V 

. V vn! lacks negation (by (i)), we can conclude from the 
implication criterion that every clause of Y subsumes a clause of 
Th, V ... Val By (ii), then, Y has no clauses but yi, .. . , 
¥,. Now assign truth to all letters of ¥:, for some 7, and falsity to 
all other letters. By (i), ¥; lacks negation and hence comes out 
true. Therefore Y comes out true (since truth of ¥; assures truth 
of '¥, V... V v,'). Therefore, by (iii), & comes out true. Hence 
some clause ¢ of ® comes out true. On the other hand each of 
Yi, ... , Wn other than y; contains a letter other than those of 
y:, by (ii), and hence comes out false under the given assignment. 
Therefore ¢, which comes out true, subsumes none of y1, ... , 
Yn, Other than y;. So, since every clause of © subsumes one or 
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another of ¥, . .. , wn, we must conclude that ¢ subsumes y; 
and none of the others. Applying this reasoning to each choice of 
z, we see that each of Wi, . . . , wa is subsumed by a clause of ® 
which subsumes no others of ~1, . . . , Ya. Therefore # is no shorter 
tan WV... Va) aed. 


CorouuaRyY. If a formula is normal and none of its clauses sub- 
sumes or conflicts with any other of tts clauses, then 1t has no shorter 
normal equivalent. 


For, there being no conflict, each letter occurs affirmatively 
only or negatively only. Read the negative literals as new affirma- 
tive letters and apply Theorem I. 

Preparatory to the other theorem which it is the business of 
this paper to prove, viz., Theorem II below, let us look back to 
the 2” fundamental formulas listed in (1). The alternation of the 
first 2 of those formulas will be called [2]. Clearly [2%] is valid, and 
hence has ‘p V 7’ as shortest normal equivalent. On the other 
hand 


THEOREM II. If m < 2* then [m] has as a shortest normal equivalent 
a formula which lacks negation. 


Proof. For each h up to n, the first 2*—* formulas of (1) exhaust 


the ways of distributing negation signs over ani, . . . , @n While 
keeping a, ... , a, affirmative. Clearly, therefore, 
(2) [2"-*) is equivalent to "a, .. . an! 


After the 2*-’th formula, the series (1) repeats as from the begin- 
ning but with a, negated. Thus, where 1 S 7 S 27~*, 


(8)  [29-* + 7] is [2"—*] in alternation with [7] with a, negated. 
Now let ii, . . . , hx be, in ascending order, the integers such that 
(4) Ne = Be ee a eee 


(They are all positive, since m < 2"; and they are distinct. To 
find them, write m in binary notation and count the places to the 
right of each occurrence of ‘1’. Each of hi, . . . , hy iS n Minus one 
of those counts.) By (4) and (3), [m] is [2"-"] in alternation with 
[Q>—A2 + | | | + 22-**) with an, negated. But, by (8) again, [2*- 
+... + 2-'*) in turn is [2*-2] in alternation with [2*-* + 

: 2"-*k] with a, negated; so [m] is the alternation of [2°—"], 
[29m with a,, negated, and [2°-** + ... + 2"-*] with a», and 
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a,, negated. Continuing thus, we finally find that [m] is the alterna- 
tion of [2°-4], [2-2] with a,, negated, [2°] with a,, and ay, 
negated, . . . , and (2"-**] with ay,, . . . , am,_, negated. But, by 
(2), (2*-] is equivalent to "a, . . . ap,'. Also, by (2), [2°] is 
equivalent to ‘a, ... a,,', and hence, since substitution for 
letters preserves equivalence, [2*~"?] with az, negated is equivalent 
to ‘ay... O%,-10%,0n41 . . . a, Continuing thus, we find [m] 
equivalent to 


mies... On, V on. . cen 1@m@nas . « . aay Vea. . . 
ge 1a . - » Riya p jg - . . Why VO. a Von... 
hy —10th Hh + + © Ahg—1OheAho tt «© + Why _y—1Ohg_1Chp_i tl 

. ea. 


Now the last two of the k clauses of (5) are related in the manner 
of ‘pq’ and ‘pgr’, with a,,_, In the role of ‘q’; and ‘pq V p@r’ is 
equivalent by truth tables to ‘pq V pr’. Hence the occurrence of 
"@n,-,' in (5) can be dropped. Again the last three clauses of the 
thus amended (5) are related in the manner of ‘pq’, ‘pGr’, and ‘pgs’, 
with a,_, in the role of ‘g’; and ‘pq V pgr V p§gs’ is equivalent to 
‘og V pr V ps’. Hence the two occurrences of '&,.' can be 
dropped. Continuing thus, we delete all negative literals from (5) 
and are left with 


r 
Ce Oi... Gh imal ©. ag V Ot Ps. ie ige 
° Aho—1%hy41 eo 2 @ hs V « ¢ « V Q1 o 8 6 hy —1@%h1 +41 e ee e 
4 7 

Oho—10%hot1 - - - Ahpy_y—10hp_jy4i - - - Any - 


But this lacks negation. Moreover, none of its clauses subsumes 
any other of its clauses; so, by Theorem I, there is no shorter 
normal equivalent. 


ey. 


at On Boolean Functions 


As auxiliaries to a concurrent study of the utility concept, two 
theorems of set theory are proving useful. The envisaged applica- 
tion of these theorems concerns only relations, or sets of ordered 
pairs, whereas the theorems hold for sets generally. Consequently 
the two theorems are most conveniently set forth separately from 
the eventual memorandum in which they are to be applied. 
Where F and G are sets, their intersect or common part will be 
represented as FG. Where Fi, .. . , Fn are sets, their union will 


be represented as » F;; this is the set whose members are all and 
w=l1 
only those objects each of which belongs to at least one of Fi, 
... , Fy. Where F is a set, its complement will be represented as 
F’; this is the set whose members are all and only the objects not 
belonging to F. Any set which is specifiable in terms of given sets 
F,, ... , Fn by means exclusively of intersect, union, and com- 
plement is called a Boolean function of Fi, ... , Fn. 
More particularly, the 2” intersects: 


FiPoFy...Fe, FiFoRs... Fn, FiFoFs... Fa, FiFiFs... Fay 
FIFF;...Fa FiFifs...Fa, ...». >, Se 


are called the constituent functions of F,, Fo, ... , Fn. For brev- 
ity they will be designated respectively as F!, F?,..., F?. 
Clearly 


(1) For every object x there is an 7 such that x is an element of F*. 


This paper is assembled from excerpts from two working memoranda issued 
under Project RAND in 1949. It is printed by permission of the RAND Cor- 
poration, Santa Monica, Calif. 
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1. Parametric Boolean Functions. Consider now any n-ary 
function ¢, Boolean or otherwise, which takes sets as arguments 
and as values. ¢ will be called a parameiric Boolean function if 


there are sets Hi, . . . , H, for somer, and an (n + r)-ary Boolean 
function y, such that 
(2) Forailsets i), ... , Fa, 

(Fi, o 8 , = (Aj, s 0 le, ay, oe ne a) 


Two sets will be said to agree in an object x when z belongs to 
both or to neither. An n-ary function ¢ of sets will be said to pre- 


serve agreement when this law holds: If Fi, ... , Fa, Gi, .. . , Gn 
are any sets, and x is any object in which F; and G; agree pairwise 
for each 7, then ¢(/1,...,F,n) and $(Gi, . .. , Gn) will agree 
m 


Now the purpose of this section is to establish 


THEOREM I. A function ts a parametric Boolean function tf and 
only uf it preserves agreement. 


Consider to begin with any parametric Boolean function ¢. Le., 
there are sets Hi, .. . , H, and a Boolean function y such that 
(2) holds. Now we want to show that ¢ preserves agreement. 
Le., given any object x such that, for each 7, F; agrees with G; 
in z, we want to show that ¥(Mi, ...,H,, Fi, ... , Fa) agrees 
with Yh, ..., H,, Gi, . . . , Ga) in x. But this is evident by 
the following consideration. Since Boolean functions are con- 
structed solely by intersect, union, and complement, member- 
ship of xin ¥(H,, ... , A,, Fi, . . . , Fn) will depend solely on 
membership or non-membership of z in the successive arguments 
Poe. ,t,, Ff, ...., Pa; so the change Of F;,..., Fa to 
Gi, . . - , Ga here will make no difference so far as x is concerned. 

It now remains only to prove the converse half of the theorem; 
viz., that any function which preserves agreement is a parametric 
Boolean function. 

Suppose then that @ preserves agreement. Let K; be the set of 
all and only those objects x for which there are sets Gi, . . . , Gn 
such that x belongs both to the constituent function G* and to ¢e¢, 
if we write ¢cg for (Gi, . . . , Gn). Note that K; depends only oni 
and ¢. By (1), every element of ¢r belongs, for some 7, to F*#r and 
hence to K; and hence to K;F'*. So 

On 
(3) or is a subset of) Rr. 


i=1 
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Next consider any z and 7 such that z belongs to K;F*. Then, by 
the definition of K;, there are G, . . . , G, such that z belongs to 
Gig. Now Fis the intersect of F; or F, with F, or F; and so on, 
and G‘ is similarly constituted with ‘G’ in place of ‘F’; hence z, be- 
longing as it does to F* and G‘, must, for each 7, belong to both 
F; and G; or to both F; and Gj. L.e., F; and G; agree in z for each 
j. Then, since ¢ preserves agreement, ¢r and ¢¢ agree in z. But z 
belongs to ¢g¢. Hence z belongs to or. For each 7, then, each element 


On 
z of K,F* belongs to ¢r. Therefore yy K;F* is a subset of or. 
i=l 
Hence, by (3), 
on 
(4) or = > kK iF % 


i=1 


But this summation is a Boolean function of Ki, ... , Ko", Fi, 
. .., Fp. Moreover, (4) has been seen to hold for all choices of 
Py, ...,F,.S0 ¢ is a parametric Boolean function. 


CoROLLARY: The number of the parameters of a parametric 


Boolean function of n variables need never exceed 2". This is evident 
from the use of Ky, . . . , Ke" in the above reasoning. 


2. Commutativity and Incidence. An n-ary Boolean function ¢ is 
called thoroughly commutative if 


(5) For all sets Fi, ..., Fa, and every permutation G),..., 
Gn of them, (fi, ..., Fn) = 6G, ... , Gn). 


Now the theorem which is to be established is one which affirms a 
correspondence between the thoroughly commutative Boolean 
n-ary functions and the classes of natural numbers Sn. 

Let us write I.(/1, . . . , Fn) for the incidence of the element x 
in the sets Fi, ... , Fn; i.e., for the number of sets from among 
F,, . .., F, to which x belongs. Now what is to be proved is 


THEOREM II. An n-ary function ¢, whose arguments and values 
are sets, ts a thoroughly commutative Boolean function if and only af 
there 1s a class N of natural numbers Sn such that 


(6) For all sets F,, . . . , Fn, and all objects x, x is an element of 
$(Fi, ... , Fa) if and only if LU, ... , Fn) belongs to WN. 
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It is well known that every Boolean function is expressible as a 
union of 20 constituent functions; i.e., for every n-ary Boolean 
function ¢ there are numbers pi, . . . , px S 2" (K = O) such that, 
femal}, ... , Fs, 


(7) Ml iccytts) = wie 


i=] 


For each 7 from 1 to 2”, let g,(¢) be the number of unbarred 
components in the constituent function F*. In other words, g,(2) 
is n minus the number of bars (indicating complement) in F’. 
Clearly g,(7) depends only on n and 7, and is independent of the 
choice of sets Fi, .. . , Fn. 

Any element x of F* is an element of all and only those of Fi, 
. . . , &, which appear as unbarred components of F*. Hence 


(8) If x is an element of F* then I.(Fi, .. . , Fn) = gn(2). 


Conversely, suppose I.,(Fi, ..., Fa) = ga(z). Then x is an 
element of some constituent function F’ which has just g,(z) un- 
barred components. But F’ is the same as F except for a permuta- 
von or f;,...,F,.S0 


(9) Tf L.(Fi, . . . , Pn) = ga(t) then x is an element of G‘ for some 
permutation Gi,...,G,of Fi, ..., Fn. 


We turn next to the proof of the forward half of Theorem II. 
Here we assume a Boolean function ¢ for which (5) holds, and seek 
a class N of natural numbers Sn for which (6) holds. 

N will be taken as the class of numbers gn(pi), . . - , 9n(Dz), 
where pi, . . . , Px are asin (7). So what is to be shown is that (6) 
holds for this choice of N. I.e., what is to be shown, for all sets 
Fy, ... ,F, and all objects z, is that 


(10) xz is an element of (Fi, ... , Fn) 
if and only if there is a j such that 
(11) LU, ...)Fa) = n(p;)- 


But if (10) holds, then by (7) there is a 7 such that z is an element 
of F?;; and accordingly (11) follows in view of (8). Conversely, if 
(11) holds, then by (9) there is a permutation Gi, ..., Gna of 
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Fy, ..., Fa such that x is an element of G” and hence of 

k 
vo G™, or (Gi, . . . , Gn); so (10) follows in view of (5). 

t=] 

We turn finally to the proof of the converse half of Theorem II. 
Here, adopting any class N of natural numbers Sn and assuming 
(6), what we have to show is that ¢ is a Boolean function fulfilling 
(5). 

Let qm, . . - , 9 be all the numbers 7 S 2* such that ga(z) be- 
longs to N. Consider any sets Fi, ..., Fn. By (8), then, any 
element x of F% will be such that I.(F1, ..., Fn) belongs to N. 
By (6), therefore, every element of F% belongs to ¢(f1,... , 
F,,). So 


h 
Cy) F% is a subset of @(F,, ... , Fa). 

Conversely, consider any element x of ¢(f1,..., Fr). By 
(6), 1.(Fi, . .. , /,) belongs to N. Also, by (1) and (8), there isa 
j such that x belongs to F’ and 1,(f1, ... , Fa) = gn(j). SO 7 is 

h 


one of q@, ..- , Qn, and hence zx belongs to 2; F%, Therefore 


t=] 


h 
o(F,, . .. , Fn) isa subset OE Hence, by (12), o(Fi, ... , 


i=1 


h 
Fi) = 3 for all Fi, ..., Fr. But this summation is a 
t=1 
Boolean function. Moreover it fulfills (5); this is seen as follows. 
Where F* is any result of permutation of F,,... , Ff, in F%, the 
number of bars in F’ is the same as in F'”; hence g(r) = g(q;); hence 
g(r) belongs to N; hence ris one of mi, . . . , gx; and hence F’”, like 
h 


F*%7, figures in yy | eae 


t=1 


mY it 


«t On the Logic of 


Quantification 


1. Predicates, Schemata, Matrices. The notation that I shall use 
here for the logic of quantification is of the familiar kind wherein 
the letters ‘p’, ‘g’, etc., stand in place of unspecified statements and 
the letters ‘f’, ‘g’, etc., stand in place of unspecified predicates. The 
present section will deal with the significance of this notation; the 
purpose and scope of the paper as a whole can better be indicated 
afterward, in §2. 

By “predicates” I mean, not properties (or classes) and relations, 
but merely certain notational expressions. As a first approach they 
may be thought of as expressions like ‘walks’, ‘is red’, ‘touches’, 
‘gives to’. Where ‘f’, ‘g’, ‘h’, and ‘k’ represent these four predicates, 
we may read ‘fx’, ‘gy’, ‘hry’, and ‘kxyz’ as ‘x walks’, ‘y is red’, ‘x 
touches y’, “x gives y to z’. 

Given any statement and any sequence of n(21) distinct names 
which occur in the statement, we want there to be a predicate 
whose application to the sequence yields the statement. Clearly, 
then, the above tentative version of predicates as mere verbs is too 
confining. We can, however, construe predicates satisfactorily as 
expressions of a somewhat more elaborate sort; namely, as expres- 
sions which are like statements except for containing circled nu- 
merals (an arbitrary device) in various places appropriate to 
names. For example, where ‘fry’ represents ‘x paid y more than 
Jones paid x’, the predicate represented by ‘f’ is taken to be ‘@ 


Reprinted from the Journal of Symbolic Logic, Volume 10, 1945. 
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paid @) more than Jones paid @’. Thus {’ followed by one or more 
variables represents the result of putting those successive variables 
respectively for ‘@)’, ‘@”’, etc., throughout the predicate represented 
bya 

The word ‘represent’, as used above, is intended in the sense of 
‘supplant’, ‘stand in the place of’; not in the sense of ‘refer to’, ‘take 
as value’. The letters ‘p’, ‘a’, ‘f’, ‘g’, etc., are thus not related to 
statements and predicates in the way in which the variables of 
ordinary algebra are related to numbers. Whereas the ‘z’, ‘y’, etc., 
of algebra behave as names of unspecified numbers, on the other 
hand ‘p’, ‘q’, ‘f’, ‘g’, etc., behave as unspecified statements and 
predicates outright, not as names of them. 

The analogy that does hold between ‘p’, ‘q’, ‘f’, ‘g’, etc., and the 
‘x’, ‘y’, ete., of algebra is rather this: the former supplant un- 
specified expressions of certain kinds, namely statements and 
predicates, while the variables of algebra supplant unspecified ex- 
pressions of another kind, namely numerals such as ‘5’, ‘7’ ete. But 
even of this analogy we must beware. It is safe so long as we do not 
think of the numerals as names of anything; but mathematics can- 
not be pushed far until we do. Implicit or explicit use of quantifica- 
tion over numbers (e.g., in the fashion ‘Whatever positive number 
xz may be, there is a number y such that .. .’) immediately 
commits us to recognizing a realm of numbers of which the nu- 
merals are names.” At this point the analogy of ‘p’, ‘q’, ‘f’, ‘g’, etc., 
to the variables of algebra breaks down, unless we care to assume 
also a realm of entities (truth values, classes, and relations?) of 
which statements and predicates are names. These entities, if 
assumed, would be values of the variables ‘p’, ‘q’, ‘f’, ‘g’, ete., just 
as numbers are values of the variables of algebra; but the assump- 
tion of such entities exceeds the scope of the logic of quantification, 
which has no need of them. 


1 Predicates in this sense were introduced in my Elementary Logic (1941), 
119, as an auxiliary to the theory of substitution. But I called them stencils, 
not perceiving how aptly the traditional term ‘predicate’ might be used at this 
point. 

2 See my “Designation and existence.”’ 

3 In higher logic, where quantification over classes and relations takes place, 
such entities have indeed to be assumed. Even there, however, I prefer riot to 
admit ‘f’, ‘g’, etc., into quantifiers, but to think of classes and relations rather 
as among the values of the regular variables of quantification ‘zx’, ‘y’, etc.; cf. 
my Mathematical Logic, §§22-23, and Elementary Logic (1941), §56. 

The so-called “algebra” or ‘‘calculus’’ of classes and that of relations, as well 
as rouch else that is commonly treated in class and relation theory, require no 
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So ‘p’, ‘a’, ‘f’, ‘g’, etc., are merely dummy letters, schematic 
letters, in place of which we may imagine any arbitrary state- 
' ments and predicates. Expressions constructed of such letters, e.g.: 


(1) G@)Gr > p) Galfz .D p, 


I call schemata; they are imaginary statements, diagrams of state- 
ments, and are distinguished from genuine statements in having 
dummy components such as ‘p’ and ‘fx’ in place of genuine com- 
ponents such as ‘birds fly’ and ‘z is red’.‘ 

The schemata of quantification theory may be specified thus: 
they comprise ‘p’, ‘q’, etc.; also ‘f’, ‘g’, etc., followed each by one or 
more occurrences of variables ‘x’, ‘y’, etc.; also everything that can 
be built up from these by means of truth functions and application 
of quantifiers ‘(x)’, ‘(y)’, ‘(4x)’, ‘(Ay)’, etc., provided that no one of 
the letters ‘f’, ‘g’, etc., occurs twice in a schema with different num- 
bers of occurrences of variables after it. 

Matrices are like statements except for containing free variables 
‘x’, ‘y’, etc., in places appropriate to names; e.g., ‘z is red’. Sche- 
mata differ fundamentally from these. Matrices are fragmentary 
expressions capable of occurring, overlaid by quantifiers, as integral 
parts of statements; schemata, on the other hand, cannot be com- 
pleted into statements, but are mere diagrams instrumental to the 
logical study of statements. 

Quantification theory can be presented, alternatively, in a meta- 
logical style in which we describe and treat statements without this 
diagrammatic appeal to schemata, and also without use of the 
notion of predicate.’ This metalogical method has advantages in 
some connections, but the method of schemata has a graphic 
quality which recommends it for general elementary purposes. 


2. Validity. Before proceeding further, a few technical terms 


quantification over classes and relations. We can, if we like, develop these 
portions of logic as a “virtual’’ theory of classes and relations which makes no 
real assumption of such entities and is wholly translatable into the present 
schematism of quantification theory; cf. my O Sentido da Nova Légica, 218- 
223, or indeed ‘‘A theory of classes presupposing no canons of type,’’ 325. 

* Whitehead and Russell’s use of ‘p’, ‘qg’, etc., and much of their use of their 
predicate letters ‘¢’, ‘y’, etc., can be construed as schematic in the above sense, 
though those authors do not distinguish between this and other interpretations. 
See §5 of I, above; also my ‘“‘Ontological remarks on the propositional calculus.”’ 
The doctrine of schemata is explicit in Cooley, 11, 75, and in my Elementary 
Logic (1941), 40, 90f., 116f., though in these two books the respective words 
‘form’ and ‘frame’ are used instead of ‘schema’. 

5 Such is my procedure in Mathematical Logic. 
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must be defined. A schema will be called n-adic if there are no more 
than n consecutive occurrences of variables® in it. Thus a monadic 
schema is one in which ‘f’, ‘g’, etc., occur only with single variables 
attached, and a medadic (0-adic) schema is one which is built up of 
just the letters ‘p’, ‘gq’, etc., by truth functions. A schema is called 
closed if it has no free variables,‘ and it is called a closure of another 
schema which has free variables if it is formed from the latter 
schema by prefixing universal quantifiers ‘(x)’, ‘(y)’, etc., corre- 
sponding to all the free variables. A statement obtainable from a 
closed schema by substitution for the schematic letters is called an 
instance of the schema. A closed schema all of whose instances are 
true is called valid; e.g., (1) above. Derivatively, schemata with 
free variables are called valid if they have valid closures. 

For the validity of medadic schemata we have a familiar deci- 
sion procedure in truth tables. The main business of quantification 
theory, thought of from the point of view of schematic presenta- 
tion, is to specify the valid schemata beyond the medadic stage. 
This may conveniently be done in two stages in turn: specifying 
the valid monadic schemata and the valid polyadic schemata. 

It has long been known that a decision procedure is possible for 
the validity of monadic schemata.’ This monadic domain embraces 
the bulk of what is familiar and most useful in quantification 
theory, including the theory of the syllogism. It is known, further- 
more, that no decision procedure is possible for the validity of 
polyadic schemata.’ It is therefore reasonable to segregate the 
monadic domain and take advantage of its amenability to a deci- 
sion procedure, thus deferring the inferior method of deductive 
proof to the polyadic domain which admits of nothing better. 
Actually the tendency in the literature has been rather to merge the 
monadic with the polyadic and use deductive proof throughout, 
probably because the known decision procedures for monadic 
schemata were rather cumbersome; the primary purpose of this 
paper, however, is to present a decision procedure for monadic 
schemata which seems convenient enough for practical and peda- 
gogical use. The procedure will be explained in §§3-4 and justified 
in §5. 


6 This refers only to variables ‘z’, ‘y’, etc., not to schematic letters. 

7 First shown by Léwenheim. For other decision procedures to the same pur- 
pose see Hilbert and Ackermann, 1938, 95-97; also the references to Skolem 
and Behmann there provided. 

8’ See XX, “‘Church’s theorem,’’ below. 
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Polyadic schemata can be handled afterward very simply. Let us 
call a schema monadically valid if it is obtainable by substitution in 
a valid monadic schema. Thus a monadically valid schema, though 
it need not be monadic, has a valid structure capable of depiction 
in a monadic schema. (Similarly I shall call a schema medadically 
valid if it is obtainable by substitution in a valid medadic schema; 
hence if it is valid by virtue of its truth-functional structure.) Now 
it turns out that all the valid polyadic schemata can be derived 
from the monadically valid schemata by repeated use of this one 
simple rule of generalized modus ponens: If a conditional ts valid, and 
ats antecedent consists of zero or more quantifiers followed by a valid 
schema, then its consequent is valid. Diagrammatically: 


from ‘(2)(%2) . . . (%a)(----) D ——’ and ‘---~-’ infer ‘ 


The rule amounts to a convenient operation of bracketing out, for 
deletion, the proved antecedent (including added quantifiers) of a 
proved conditional. An example of its use, and a proof of the ade- 
quacy just now claimed for it, will appear in §7. 


) 


8. Reduction to Basic Schemata. A schema will be called a basic 
quantification if it consists of a universal quantifier (say ‘(x)’) fol- 
lowed by a truth function built up of components each of which 
consists of a predicate letter and a single occurrence of the variable 
of quantification (thus ‘fz’, ‘gx’, etc.). A schema will be called basic, 
more generally, if it is a truth function all of whose components are 
statement-letters (‘p’, ‘q’, etc.) or basic quantifications. Equiva- 
lently: a basic schema is a closed monadic schema wherein all 
quantifications are basic quantifications. 

In §4 a method will be explained for testing the validity of basic 
schemata. In the present section, meanwhile, we shall see how to 
transform any closed monadic schema into an equivalent which is 
basic. The combined techniques of the two sections then enable us 
to test any monadic schema for validity, simply by transforming its 
closure into a basic equivalent and testing the latter for validity. 

Consider, to begin with, a quantification ‘(z)(. . . .)’ where 
‘... . 1s a truth function of components at least one of which is 
devoid of free occurrences of ‘x’. Select such a component, say ‘p’. 
(Instead of ‘p’ it could just as well be ‘fy’, or ‘(y)(fy . ~fz)’, ete.) 
Find, by truth-function theory, the two expressions to which 
‘. . . .” reduces on the respective assumptions that ‘p’ is true and 
that ‘p’ is false. (An easy graphic way is to put the appropriate 
truth-value letter, “I’ or ‘F’, for ‘p’ throughout ‘. . . .’ and then 
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progressively reduce ‘~T’ to ‘F’, ‘~F’ to ‘T’, ‘T. fx’ to ‘fz’, 
‘F . fx’ to ‘F’, ‘fr > F’ to ‘~fx’, and so on.) Where the two ex- 


pressions to which ‘. . . .’ thus reduces are respectively ‘---~-’ and 
‘ ’, our original quantification ‘(z)(. . . .)’ can now be given 
the equivalent form: 

(2) pe GQ\G== eV > ap iG) a 


If it should happen that ‘----’ or ‘ 7 is simply ‘T’ or ‘F’, (2) 
reduces further to one or another of these: 


Qiew Qo) pl @OC--), =. OO), eee 
If both ‘----’ and ‘ ’ are ‘T’ or ‘F’, (2) reduces to one or another 
of: 

(4) oy D PP: ~h Pee 


We thus have a general method for exporting ‘p’ from the quantifi- 
cation; namely, transformation of ‘(7)(. . . . )’ into (2), or into its 
simpler equivalent from among (8)-(4). 

Now the method of transforming any closed monadic schema ¢ 
into a basic equivalent consists merely in continued use of the above 
exportation technique, starting in the innermost quantifications of 
@. (Any occurrences of ‘(3x)’, ‘(ay)’, etc., are first translated into 
‘~(x)~’, ‘~(y)~’, etc., so that we have only universal quantifica- 
tions.) 

For example, let us transform the closure of the monadic schema: 


(5) (z)(fc. fy .D gz). p. (a)fz .D gy, 
namely: 
(6) (y) (2) Ga . fy .D gz) .p. (2) fx .D gy), 


into a basic equivalent. Exportation of ‘fy’ from ‘(x)(fz .fy .D gx)’ 
gives ‘fy _ (x)(fx D gx)’ (answering to the second form in (8)), so 
that (6) becomes: 


(7) (y) fy > (x) (fz D gx) .p. fe . Digy). 
Exportation of “p . (x)fx’ from (7) gives: 
(8) p.@fze.D Wy TD OG Dae @)- 
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Exportation of ‘(x)(fz D gx)’ from the consequent of (8) gives: 
(x) (fx D gx). gy .V. ~(a) (fx D gx) . Y)(~fy D gy) 
(corresponding to (2)), so that (8) becomes the basic schema: 


(Op .(a)jz .D: 
@)Ge> 92). Hoy.V.~WGt D ge) oe) (~ De. 


As in the above example, so in general, the continued process of 
exportation beginning in innermost quantifications will always 
turn a closed monadic schema ¢ eventually into a basic schema y. 
This is seen as follows. Since ¢ is monadic, any innermost quantifi- 
cation in it must consist of a quantifier followed by a truth function 
whose components are of at most these three kinds: (i) a predicate 
letter followed by the variable of quantification, (ii) a predicate 
letter followed by a variable other than that of the quantification, 
(iii) a statement letter. But components of kinds (ii) and (iii), lack- 
ing as they do the variable of quantification, can all be exported. In 
this way any innermost quantification in ¢ gives way to a schema 
involving none but basic quantifications. So now the next broader 
quantifications come to consist each of a quantifier followed by a 
truth function whose components are of at most four kinds: (i), 
(ii), (iii), and basic quantifications. But components of kinds (ii) 
and (iii) can be exported as before; and so also can the basic quanti- 
fications, since they are closed and therefore have no free occur- 
rences of the variable of the broader quantification. In this way, 
progressively broader quantifications give way to schemata involv- 
ing none but basic quantifications, until finally ¢ is transformed 
into an equivalent y in which all quantifications are basic. More- 
over, w will, like ¢, be monadic and closed, since the intervening 
process of exportation cannot give rise to any polyadic ingredient 
or free variable not present in ¢. Thus y is basic. 


4. Validity Test for Basic Schemata. The validity test for basic 
schemata which is to be presented consists essentially of a certain 
combination of the following familiar techniques of truth-function 
theory: 

(i) Constructing a truth table under a complex truth function of 
given components to see which assignments of truth values to the 
components would falsify the compound. 

(ii) Testing for medadic (or truth-functional) validity or con- 
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travalidity. This is done by setting up a table as in (i) and seeing 
whether none or all of the assignments falsify the compound. 

(iii) Testing to see whether one schema, x, medadically implies 
another, w. This is done by setting up a table and seeing whether 
every assignment of truth values which verifies x verifies w.° 

Now preparatory to testing basic schemata for validity, all their 
variables are to be rewritten uniformly as ‘x’. For example, (9) 
becomes: 


(10) p. (x)fz.D: 
(x) (fc _-) gz) . (alge .V. ~@) fz > gz). ae Daa 


Such relettering will never engender conflict, since in a basic 
schema all occurrences of variables are bound to quantifiers with 
non-overlapping scopes. 

For any basic schema y, thus relettered, the validity test is as 
follows. First set up a truth table under y, assigning “T’s and ‘F’s 
in all combinations to the statement letters and the quantifications 
of y and calculating, for each row of the table, the value of y. Keep 
only the rows that yield ‘F’ for y. (If there are none, so that y is 
medadically valid, the test is already at an end.) Then determine 
whether each of these rows meets at least one of the following condi- 
tions: 

(a) It assigns ‘F’ to a quantification whose scope is medadically 
valid. 

(b) It assigns “T’ to one or more quantifications whose scope, or 
the conjunction of whose scopes, is medadically contravalid. 

(c) The scope, or conjunction of scopes, of one or more quanti- 
fications assigned “T’ medadically implies the scope of a quantifica- 
tion assigned ‘F’. 

As soon as a row is found to meet one of the conditions (a)-(c), 
we can drop it and go on to the next row. As soon as a row is found 
to meet none of (a)—(c), we can stop altogether, having found y non- 
valid. If every row meets one of the conditions (a)—(c), w is valid. 

The only reason for including (a) and (b) in the routine is to 
provide for rows where no quantification is marked “T’ or none is 
marked ‘F’ ; for otherwise (a) and (b) are merely extreme cases of (c). 

That the test affords a sufficient condition of validity—i.e., that 


® Concerning this test there is the following suggestion in O Sentido da Nova 
Légica, 65f: If x is simpler than w, work out the table for x and then extend the 
table to w in just those lines where x came out true. If w is simpler, work out the 
table for w and then extend it to x in just those lines where w came out false. 
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no basic schema y passing the test has a false instance ¥°—is im- 
mediately apparent: every assortment of truth-values for the 
quantifications of ¥° which could conduce to falsehood of y is 
precluded by (a), (b), or (c). That the test also affords a necessary 
condition of validity will be proved in §5. 

Meanwhile let us note some practical shortcuts. Ordinarily it 
will not be necessary to construct a full truth table under y. In 
practice we can, by inspection, usually sidestep most or all of the 
rows that verify ¥, rather than working them out and dropping 
them afterward. In the case of the conditional (10), for example, 
we simply reflect that the only assignments of truth values capable 
of falsifying the whole must be such as to verify the antecedent 
(hence assignment of ‘T’ to ‘p’ and ‘(x)fz’) and falsify both alter- 
natives of the consequent. Thus the only assignments that make 
(10) come out false are those shown in the four rows below: 


p. (x)fx .D: (x) (fa D gx). (x)gz .V.~(x) G2 D gx) . (2) (~fz D gx) 
a. oT A F a 
oT 1 F F 
ae F T F 
1 F F F 


(The value assigned to each quantification is marked under the 
quantifier, and a quantification occurring twice is marked at its 
first occurrence.) Finally, moving to (a)—(c), we can usually spot 
the desired case of medadic contravalidity or validity or implica- 
tion by inspection, without auxiliary tables, if it exists at all. In the 
above example, the fulfillment of (c) by the first two lines is recog- 
nized simply by noting that the conjunction of the scopes ‘fx’ and 
‘fx _) gx’ medadically implies ‘gx’; and the fulfillment of (c) by the 
remaining two lines is recognized by noting that ‘fx’ medadically 
implies ‘~fx > gz’. (10) is valid; hence also (5). 


5. Proof That Passing the Test Is Necessary for Validity. It is 
now to be proved that the test gives a necessary condition of valid- 
ity of basic schemata. I.e., it is to be proved that every basic 
schema failing the test has a false instance. Suppose, then, that w is 
a basic schema failing the test, so that there is an assignment A of 
“T’s and ‘F’s to the statement letters and quantifications of y which 
yields ‘F’ for y as a whole and yet conforms to none of (a)-(c). 
Certain substitutions upon the schematic letters of w will be speci- 
fied, for forming an instance ¥°; and it will be shown that these 
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substitutions turn the statement letters and quantifications of y 
into statements whose truth values realize A, so that y is false. 

‘0 = 0’, to begin with, is to be substituted for those statement 
letters of Y which are marked ‘T”’ in A, and ‘0 # 0’ for those which 
are marked ‘F’. Clearly these substitutions realize A so far as they 
go. It remains only to find matrices to substitute for the ‘fz’, ‘gz’, 
etc., of Y which will cause the quantifications also to realize A. 

Let us suppose there are k predicate letters in ¥, and think of 
them notas 7’, ‘g’, etc., butvas ‘fi, f.’, . . . “fr. Letina, x ee 
Xm (m 2 0) be the scopes of those quantifications in y which are 
marked “T’ in A, and let a, we, . . . , wn (n 2 O) be the scopes of 
the others. 

Case 1:n = 0. Let Ao be an assignment of ‘T’s and ‘F’s to ‘f,2’, 
‘fox’, . . . , ‘fgx’ Which makes all of x1, x2, . . . , Xm turn out as 
“T’. (There is such an Apo, in view of the fact that (b) fails for A.) 
Now the matrices substituted for ‘f,2’, ‘for’, . . . , ‘fxa’ in forming 
y° from yw are to be ‘x = 2’ and ‘x ~ x’, as follows: for such of 
‘fix’, ‘fox’, . . . , feu’ aS are marked ‘T’ by Ao, we substitute 
‘x = x’; for the others, ‘x ¥ x’. Since this turns x1, x2, . . . , Xm 
into matrices which hold true for all values of ‘z’, all the quantifica- 
tions in y° are true and hence realize A. 

Case 2:n > 0. Let Ai, for each 7 from 1 to n, be a certain assign- 
ment of “T’s and ‘F’s to ‘fix’, ‘for’, . . . , fxx’ causing all of x1, x2, 
. ss )Xm(m 2 0) to come out as ‘T’ and w,; as ‘F’. (There is such an 
A,, for each 2, in view of the fact that (a) and (c) fail for A.) Now 
the matrix to be substituted for ‘f,;x’, for each 7 from 1 to k, will be 
abbreviated as ‘f;°x’ and is to be determined as follows. Consider 


those assignments, if any, from among Ai, ..., Ana, in which 
‘f;x’ 1s assigned “T’. If there are none, ‘fz’ is to mean ‘x ¥ 2’. If 
there are some, say Aa,,Ab;, . . . , but A, is not among them, then 
‘fx’ is to mean: 
(11) z=a;.V.2=0;.V © S cctene 
If finally A, is among them, then ‘fz’ is to mean: 
Gaya — a;.V.2 = b;.V... 2 ae So ee 

ZNO ee... Ln 


So now y” has been specified for Case 2, and it remains only to show 
that its quantifications realize A. 

Note first that, for each 7 from 1 to n and each j from 1 to k, the 
statement ‘fz’ has the truth value which is assigned to ‘f;z’ by A;. 
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For, if that truth value is ‘T’, then ‘f;°v’ is according to the above 
specifications a truth of the form: 

i=a;.V.t=);.V..... VS... 


If on the other hand that truth value is ‘F’, then ‘f,’ is a falsehood 
of one of the three forms: 


1 #1, 

ae, .\/.t = b;.V.... (wheres is not onewof a;, bj, .. . ), 

mae. 2 = 0;.V.... .V. 6H a Vat Al. FZ... Fm 

(where 7 is not one of a;, b;,. . . , n butisoneof 1,2, ... ,n). 
Now let x9, . . . » Xm°, wi®, . . . , @a’ be the matrices in 7, 

corresponding to the x1, . . . , Xm, 1, . . . , @n of y. Since A; was 

an assignment of ‘T’s and ‘F’s to ‘fiw’, . . . , ‘fx’ causing w; to come 


out as ‘F’, it follows from the preceding paragraph that the number 
i, as value of ‘x’, fails to satisfy the matrix w,°. Each of the matrices 
wy, . . . , Wn? thus fails for at least one value of ‘x’. The quantifica- 
tions ‘(z)(. . . . )’ that have the scopes a°, . . . , wn? are there- 
fore all false. 

Similarly, since A; was an assignment causing all of 11, ... , 
Xm to come out as “T”’, it follows that 7 as value of ‘w’ fulfills all the 
matrices x1°, . . . , Xm°. This is true for each integer 7 from 1 to n. 
Moreover, any value of ‘x’ other than integers 1 to n will fulfill 
x1, - . - , Xm° if m does; for, x7", . . . , xm° are truth functions of 
‘fon’, . . . , fou’, and every object other than 1, ... , n fulfills 
exactly those of ‘fi°w’, . . . , ‘fi°x’ which n fulfills (viz., those of the 
form (12)). Hence x... ., xm are fulfilled not only by the 
integers | to n but by any other object as well. Thus the quantifica- 
tions ‘(z)(. .. .)’ having the scopes x)°,..., xm’ are true. 
Since those having the scopes w)°, . . . , wa? were seen to be false, 
the proof is complete that the quantifications in y realize A. 


6. Illustrations. All the monadic schemata which ordinarily ap- 
pear in the books can be put through the present testing routine 
quickly and easily. Let us begin with some which are not basic. 


(A) (ie Dig. 
Since this has a free variable, we take its closure ‘(y)((x)fx D fy)’. 
Exportation of ‘(x)fx’ by the method of §3 turns this into ‘(x)fx D 


(y)fy’. Relettering to ‘x’ as at the beginning of §4, we have ‘(x)fx D 
(x)fx’, which is medadically valid. Thus (A) has been found valid 
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through the mere reduction process, without use of the testing 
method of §4. 
The same thing happens with: 


(B) fy D (x)fz, 


only here we have to translate ‘(4x)’ into ‘~(x2)~’. Forming the 

closure ‘(y)(fy D ~(x) ~ fz)’ and exporting ‘(x) ~ fz’, we have 

‘(a) ~ fx D (y) ~ fy’, which, relettered to ‘x’, is medadically valid. 
Two more examples: 


(C) (x)(p D fx) D. p D (a) fz. 


Exportation of ‘p’ from the antecedent turns (C) into the medad- 
ically valid basic schema: 


p> @ife Dap ee 
so that again no use is required of the testing method of §4. 


(D) (x) (fx D p) D. Ax)fx D p. 


Translating ‘(Jz)’ into ‘~(x)~”’ as usual, and exporting ‘p’ from 
the antecedent, we have the medadically valid basic schema: 


pV (0) ~ fr. Se) i oe 


Biconditionals corresponding to the conditionals (C) and (D) prove 
valid, of course, in similar fashion. 

The validity of (A) and (B) is affirmed in effect by the meta- 
theorems *104 and *134 of Mathematical Logic (1940). These meta- 
theorems are two among thirteen in that book which affirm, in 
effect, the validity of certain non-basic monadic schemata.!® All 
thirteen schemata turn out to reduce, like (A)—(D), to medadically 
valid basic schemata, so that no use of the testing method of §4 
has to be made in connection with any of them. 

The rest of the valid monadic schemata which ordinarily appear 
in the books are already basic and hence can be tested directly by 
the method of §4. Four examples will be set forth. Under each 
schema those truth-value assignments are shown which render the 
whole false; they may be presumed discovered by rapid inspection. 


(E) (x) (fa D gx) D. (x)fx D (x)gx 
a a Fr 


10 The other eleven are *103, *110, *118, *135, *137, *157-162. 
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To test this we observe that ‘fr > gx. fx’ medadically implies 
‘gx’, fulfilling (c) of §4. 


(F) (x) (fx. gx) =. Gfx. @)gz 
4) T F 
i. es | 
ci F F 
F LY 


To test this we observe that the first three rows all fulfill (c), since 
‘fa . gx’ medadically implies ‘gz’ and ‘fx’; and also that the fourth 
row fulfills (c), since ‘fz . gx’ medadically implies ‘fz . gz’. 


(G) (x) (fa D gx) . (x)(ge D ha) .D (x) (fx D he) 
a J F 


To test this, which is the principle of the syllogism, we observe that 
“fa > gx .gx > hx’ medadically implies ‘fz D hz’, fulfilling (c). 


(H) (x)fr D ~(x) ~ fx 
a a 


To test this schema (in which ‘(az)’ has been translated into 
‘~(x)~’) we observe that ‘fx.~fz’ is medadically contravalid, 
fulfilling (b). 

The reader will find that all further basic schemata corresponding 
to metatheorems of Mathematical Logic come out as briefly as 
the above four. 


7. Polyadic Theory. To derive any further valid schema of 
quantification theory, we start with monadically valid ones and 
use the rule of generalized modus ponens (§2). A convenient nota- 
tion is this: Write down the required monadically valid schemata, 
justifying each by a reference to the valid monadic schema from 
which it is got by substitution; use reference numerals as abbrevia- 
tions for any recurrences of valid schemata which have already 
been numbered; and use brackets to indicate suppression of an 
antecedent by generalized modus ponens.'? Example: 


(I) (Ax) (y)fay D (y) Ax) fay 
Proof: From (B): fay D (Ax) fry (1) 
From (E): — [(y)1 D.](y)fay D (y) Aa) fry (2) 


From (D): (Gye >t 


“a Wamiely, *115, *130, *131, *141—*156. 
12This notation is already familiar to readers of Mathematical Logic; see 
92, 94. 
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In theory there is no need to recognize, over and above generalized 
modus ponens and ordinary substitution, a third operation having 
to do with relettering of bound variables; for, the effect of such 
relettering can always be gained by choosing differently lettered 
monadic schemata in the first place. The citation ‘(E)’ in the 
above proof, for example, may be understood as referring not to 
(E) itself but to the different valid monadic schema: 


yy D gy) D. Y~fy D gy. 


The above example (I) is the polyadic schema corresponding to 
the metatheorem *139 of Mathematical Logic. It will now be 
proved that every valid schema can be derived in similar fashion. 
To prove this it will be sufficient to show that all schemata de- 
monstrable in the system of Hilbert and Ackermann"—which I 
shall call H—can be obtained from monadically valid schemata by 
generalized modus ponens. For Hilbert and Ackermann prove in 
turn that every valid schema is demonstrable in H.4 

H has by way of axioms the monadic schemata (A) and (B), 
above, and various valid medadic schemata. The rules of inference 
are these: 

(a) Substitution. 

(8) Simple modus ponens. 

(y1) In a conditional schema containing ‘zx’ as free variable in 
the consequent only, insert the quantifier ‘(x)’ to govern the con- 
sequent. 

(y2) In a conditional schema containing ‘x’ as free variable in 
the antecedent only, insert ‘(av)’ to govern the antecedent. 

(6) Relettering of bound variables. 

Let us liberalize (v1) and (72) to the extent of allowing them to 
apply not just to the specific letter ‘x’ but equally to ‘y’, ‘z’, ete. 
Let us now call them (y’1) and (y’2) and eall the system A’. 

Anything demonstrable in H’ is demonstrable in such a way 
that all use of (a) and (6) precedes all use of (8), (y’1), and (y’2). 
For, consider any proof involving substitution and relettering in- 
termediately; we can trace the proof back to its starting point in 
the axioms, do the substituting and relettering rather in these, 
and then reproduce the old proof step by step carrying along the 
substituted expressions and new variables in their proper places, 
since any use of (8), (y‘1), and (y’2) along the way will apply just 


18 1938, 56-57. 
14 Tbid., 74-81. (After Gédel.) 
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as well to the substituted and relettered material. 

Hence whatever is demonstrable in H’ (or, a fortiori, in H) is 
derivable from monadically valid schemata by (8), (y‘1), and 
(y’2). This follows from the foregoing paragraph in view of the 
fact that the axioms are valid monadic schemata and everything 
got from them by (a) and (6) is monadically valid. 

But anything derivable from monadically valid schemata by 
(8), (y’1), and (y’2) is also derivable from monadically valid 
schemata by generalized modus ponens; for, (8) is a special case 
of generalized modus ponens, and (y‘1) and (7’2) yield no more 
than can be got with help of (C) or (D) as in the last step of the 
proof of (I) above. 


xy ITT 


« A Proof Procedure for 
Quantification Theory 


The purpose of this paper is to present and justify a simple proof 
procedure for quantification theory. The procedure will take the 
form of a method for proving a quantificational schema to be zncon- 
sistent, i.e., satisfiable in no non-empty universe. But it serves 
equally for proving validity, since we can show a schema valid by 
showing its negation inconsistent. 

Method A, as I shall call it, will appear first, followed by a more 
practical adaptation which I shall call B. The soundness and com- 
pleteness of A will be established, and the equivalence of A and B. 
Method A is from Skolem, 1928. 

The reader need be conversant with little more than the fairly 
conventional use of such terms as ‘quantificational schema’, ‘in- 
terpretation’, ‘valid’, ‘consistent’, ‘prenex’, and my notation of 
quasi-quotation.} 


1. Method A. Where ¢ is any prenex quantificational schema, 
its functional normal form (as I shall call it) is got from it by delet- 
ing all existential quantifiers and attaching subscripts to the re- 
currences of their variables, according to the following scheme: if 
the universal quantifiers to the left of "(38) ‘are "(a1)', . . . ,"(a@n)! 
in that order, then ' (48) 'is dropped and B is rewritten in the sequel 
as 'Ba,.++a,'. No meaning need be associated with the subscript 
notation. 

E.g., the prenex schema: 

Reprinted from the Journal of Symbolic Logic, Volume 20, 1955. Historical re- 


marks have been improved thanks to Dreben. 
1See VII and XVII above. ‘Prenex’ means that the quantifiers are initial. 
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(1) (Ay) (2) w) (2) [Fey D —(Faz. Fer). —(Frw. Fw) D Fry] 
has this as its functional normal form: 


(2) (x)(z) [Fry D —(F2z. Pex). —(Fxrw,. Fw.) D Fry). 


Given now any functional normal form y, consider the following 
class C of terms. C' is to contain, to begin with, all those free varia- 
bles of y which carry no subscripts (or ‘a’, arbitrarily, if there are 
none). Further, if a letter occurs with subscripts in y, then that 
same letter, with members of C in place of all its subscripts, is to 
belong in turn to C. This class C, usually infinite, will be called the 
lexicon of y. Thus the lexicon of (2) comprises ‘y’, ‘w,’, ‘wu,’, and 
so on. (Where the iteration of subscripts threatens to get out of 
hand, we may conveniently switch to a linear notation: ‘[wy]’ for 
‘wy’, ‘[w[wy]]’ for ‘ww,’, ete.) 

By a lexical instance of a functional normal form y will be meant 
any result of dropping the quantifiers of y (which are all universal) 
and substituting, for the recurrences of the variables of those 
quantifiers, terms from the lexicon of y. (The substitutions must 
reach all occurrences of the variables, not omitting occurrences 
within subscripts.) Here, e.g., are two lexical instances of (2): 


(3) Fyy D —(Fyy . Fyy) .—(Fyw, . Puy) D Fyy, 
4) Fw,y D —(Fwy . Fyw,) . —(Fu,jwo, . Pwy,w,) D Fuy,y. 


Now what I call Method A, for proving the inconsistency of a 
quantificational schema 4, is as follows: put ¢ into prenex form,? 
then get the functional normal form, and finally present a truth- 
functionally inconsistent lexical instance or conjunction of lexical 
instances. 

E.g., suppose we want to show the inconsistency of: 


(Jy) (a) [Fry = — (a2) (Fz. Fezx)|. 
Expanded, this becomes: 
(5) (ay)(a)[Fary D — (2) (Faz. Fer). —€z)(F2z. F2x) D Fry), 


and thereupon goes into the prenex equivalent (1). But the func- 
tional normal form (2) of (1) has the lexical instances (3) and (4), 
which are together inconsistent by truth table. 
2 This well-known transformation is explained, e.g., in Methods of Logic, 227. 
8 When ‘F’ is taken as ‘e’, this is the second paradox of a series which begins 


with Russell’s. See my Mathematical Logic, 128f.; also ‘‘On the theory of types,’’ 
1271. 
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It will be shown in §§4-5 that Method A is sound and complete. 
Meanwhile let us turn to a convenient variant of A, which I shall 
call B. 


2, Method B. Under this method we show the inconsistency of a 
conjunction of prenex schemata without first making a single pre- 
nex schema of it. For the purpose of formulating this method, I 
shall need to speak of the “lexicon” not of a single functional 
normal form y, as above, but of a conjunction 'y, . . . ¥,'of func- 
tional normal forms. The lexicon of "y, . .. ¥,!1is to contain all 
variables which occur free and without subscripts in any of yw, 

. , Wa (or ‘a’ if there are none); further, if a letter occurs with 
subscripts in any of ¥,... , Ww, then it with members of the 
lexicon in place of its subscripts is to belong in turn to the lexicon. 

E.g., the conjunction of: 


(6) GG) (FON . Baten Pony VV MPa Tage): 
(7) (x)(2) —(Fy . Fee), 


has as its lexicon the class whose members are ‘z’ and, for all mem- 
bers ¢ and 7 of the class, "y'and 'z,'. Note that this lexicon is not 
simply the sum of the respective lexica of (6), (7), and (8). 

By a lexical instance of a conjunction 'y, . . . ¥,! of functional 
normal forms will be meant any expression obtained from any of 
vi, . . . , Yn by dropping the quantifiers and substituting, for the 
recurrences of their variables, terms from the lexicon of "Wy, .. . Wa! 

The conjunction of (6)-(8), e.g., has the following among its 
lexical instances: 


(9) Bays Via Xhepegs ER tlog V2 Tae ice 
(10) — (FY s2ry, + F2sy,0), 
(11) ays, 
(12) — Fr22y,. 
Now Method B, for proving the inconsistency of a conjunction 
"di... on! Of quantificational schemata, is as follows. Put 4, 
. , n into prenex normal forms ¢;, . . . , ¢'n, so choosing the 


variables that none occurs both free and in an existential quanti- 
fier, nor twice in existential quantifiers. Then get the respective 
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functional normal forms, .. . ,%n of $1, . . . , $n, and finally 
present a truth-functionally inconsistent lexical instance or con- 
junction of lexical instances of "Yi. . . wa’. 

E.g., to prove the inconsistency of the conjunction of: 
(13) (z)(y) (Pay V 2)(az . Fay .V. Byeud2n)|, 
(14) — (4x) (y) Gz) (Fy2 . Fax), 
(15) (ax) — (ay) Fry, 


we first put (13)—(15) into prenex form thus: 
(x) (y) (Az) (Fey V. Fuz. Fey .V. Fyz. Fez), 
(x) (Ay) (z) —(Fyz . Fea), 
(dx) (y) — Fay, 


then obtain the corresponding functional normal forms (6)-(8), 
and finally present the truth-functionally inconsistent conjunction 
of lexical instances (9)—(12). 

To show that a conclusion follows from given premises, we have 
merely to show by Method B that the premises and the negation 
of the conclusion are inconsistent. E.g., to show that ‘(3z) (y) (az) 
(Fyz . Fzx)’ follows from (18) and (15), we show (13), (14), and 
(15) jointly inconsistent as above. 


8. Two Strategies. For a second example of Method B, let us 
show that transitivity and irreflexivity: 


(16) (x)(y) (2) (Fay . Fyz.D Fzz), 
(17) (x) —Faxx 


together imply asymmetry: (x)(y)(Fry D —Fyz). I.e., what is to 
be shown is the joint inconsistency of (16), (17), and ‘—(2)(y) 
(Fzy D —Fyzx)’. This last goes over into prenex form as: 


(18) (42) Gy) — (ry >. — yz). 


The functional normal forms of (16) and (17) are (16) and (17) 

themselves; that of (18) is ‘—(Fxy D —Fyz)’. The lexicon com- 

prises just ‘z’ and ‘y’. Finally we present three lexical instances: 
Pcie Tye 5 el ve, —F rz, — (Fry D —Fyz), 


which are, in conjunction, truth-functionally inconsistent. 
As this example illustrates, a finite lexicon is assured when none 


— 
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of our prenex schemata has a universal quantifier to the left of an 
existential one. When the lexicon is finite, the number of lexical 
instances is likewise finite; and then an actual decision procedure 
for inconsistency is at hand, since we can subject the conjunction 
of all lexical instances to a truth-table test.4 In turning schemata 
into prenex form it is therefore advantageous to bring existential 
quantifiers out ahead of universal ones, insofar as we have a choice. 
This policy is advantageous even when we cannot get all the ex- 
istential quantifiers out in front; for in any event we minimize the 
influx of subscripts. 

An advantage of Method B over Method A is that any given ex- 
istential quantifier tends, under Method B, to be preceded by 
fewer universal quantifiers than under Method A. A more obvious 
advantage of Method B is that it reduces the complexity of the 
individual lexical instances. A useful strategy, therefore, is that of 
maximum fragmentation: where possible, render a schema as a 
conjunction of shorter schemata and thus exploit the advantages 
of Method B to the full. Such fragmentation can often be facilitated 
by these two expedients: (a) Drop all distinctively lettered initial 
existential quantifiers, these being obviously immaterial to con- 
sistency; and (b) Distribute universal quantifiers into conjunc- 
tions, where possible, according to the equivalence of ‘(x)(Fr. 
Gamers. (2)Gz’. 

In the case of (5), e.g., we might have dropped the initial ‘(ay)’ 
and distributed the ‘(x)’, thus turning (5) into the conjunction of: 


(x)[Fxy D — (az) (F rz. F2x)], 
(x)(— lz)(Frz. Fzx) D Fry, 


which could thereupon be handled by Method B. 

After establishing in §§4-5 the soundness and completeness of 
Method A, I shall prove in §6 that Method B is equivalent to A 
and therefore sound and complete as well. 


4. Soundness of Method A means that if a prenex schema ¢ is 
consistent then its functional normal form has no truth-function- 
ally inconsistent conjunction of lexical instances. 

Let us think of ¢ as: 


(19) (ax) (y) (2) Au) (v2) Gw) &@, y, 2, U, », w); 


4 Substantially this case of the decision problem was first solved by Bernays 
and Schénfinkel. 
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this will typify the general case well enough. Now we are suppos- 
ing ¢ consistent, hence true under some interpretation & in some 
non-empty universe U; and what we want to show is that every 
conjunction of lexical instances of the functional normal form: 


(20) (y) (2)(v) B(x, y, Z, Uys, 0, Wyeo) 


is truth-functionally consistent. 

Let us adopt the interpretation 3, thus supposing (19) simply 
true. But (19) says there is a particular object, which [ shall here- 
after call x, such that 


(21) (y) (2) (Au) (v) Aw) B(a, y, 2, u, v, w). 


According to (21) there is, for each choice of y and z, an object— 
which let us call u,,—such that (v)(dw) ®(2z, y, 2, Uy, v, w). So 


(22) (y)(z)(v) (Aw) Ba, y, 2, Uye, ¥, W). 


According to (22) there is, for each choice of y, z, and v, an object— 
which let us call w,,.—such that ®(2, y, 2, Uyz, V, Wyer). So (20), 
under these provisions, becomes true and hence so do all its in- 
stances. But this would be impossible if a conjunction of these 
instances were truth-functionally inconsistent.® 


&. Completeness of Method A means that if a prenex schema ¢ is 
inconsistent, 1.e., if 

(i) @ is false under every interpretation in every non-empty 
universe, 


then the functional normal form y of ¢ has a truth-functionally 
inconsistent conjunction of lexical instances. 

Let &, &, ... be all the expressions obtainable by applying 
predicate letters of y to terms from the lexicon of w. Clearly all the 
lexical instances of y are truth functions of &, &, .... 

Let U be a universe in one-to-one correlation with the lexicon 
of y. If we construe the terms of the lexicon as naming their respec- 
tive correlates in U, then an interpretation of the predicate letters 
of ¥ (with U as universe) is fixed by any full assignment of truth 
values to &, &, ... . Moreover, any such interpretation makes 
y true if it makes all the lexical instances true; for, given that the 


5 Note added 1966: This sketchy argument depends on the axiom of choice, as 
Dreben has remarked. We can avoid that dependence if we wish, thanks to 
Léwenheim’s theorem, by taking U as the universe of positive integers. We 
can then specify u,z as the least u such that (v)(Aw)®(az, y, z, u, v, w); and simi- 
larly for wyz.. See next footnote. 
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lexicon names all the objects of U, the lexical instances exhaust 
the cases of y. But y in turn implies ¢; this becomes generally 
evident if we look back to the quasi-example (20) and observe that 
it implies (22), which implies (21), which implies (19). By (i), then, 


Gi) No assignment of truth values to &, &, ... makes all 
lexical instances of w true. 


Let us speak of a given assignment of truth values to &, ... , 
£; as condemning a given conjunction of lexical instances of wy if it 
makes that conjunction come out false for all values of &41, &:42, 
... . Then let ¢; for each z be one or other of the truth values, 
T or L, according to this inductive condition: 


Gii) ¢;is T if assignment of h, . . . , ¢:-1, and T respectively to 
é, ... , & condemns no conjunction of lexical instances of y; 
otherwise L. 


Thus é,, to begin with, is T if assignment of T to & condemns no 
conjunction of lexical instances; otherwise __. The stipulation (ii) 
fixes t; for each 2, or for each 2 up to the number of é’s, according as 
the é’s are infinite or finite in number. 

By (ii), there is a lexical instance x of » which comes out false 


when i, tf, . . . are assigned respectively to &, f, ... . Then, 
where j is high enough so that none of £41, £42, . . . occurs in x, 
it follows that the assignment of i, ... , ¢; respectively to &, 


... , & falsifies x (and hence, a fortiorz, condemns x). Then there 
is also a least h, whether 7 or less, such that the assignment of ¢,, 
...,t&to&, ... ,& condemns a conjunction of lexical instances 
of y. (Note that 4, . . . , ft, may condemn a conjunction of lexical 
instances even where hf is too small foré,, . . . , 4, to condemn any 
one lexical instance.) For that h, by (iii), t, must be L. So 


(iv) Assignment of ¢;, . . . , &1, L respectively to &, ...,& 
condemns some conjunction w of lexical instances of y. 
But also 


(v) Assignment of ti, . . . , éx-1, T respectively to &, ...,& 
condemns some conjunction w’ of lexical instances of y, 


since otherwise, by (iii), t, would be T. Therefore h = 1; for other- 
wise, by (iv) and (v), the assignment of 4, ...,t&4to&,..., 
£,1 would condemn ww’, contrary to the leastness of h. But then, 
taking h as 1 in (iv) and (v), we see that w and w’ are respectively 
condemned by assigning T and L to &, and hence that ‘ww’'is a 


PROOF FOR QUANTIFICATION THEORY 203 


truth-functionally inconsistent conjunction of lexical instances.® 


6. Reduction of B to A. The soundness and completeness of 
Method B will now be established by showing that B accomplishes 
all that Method A accomplishes and nothing more. 

Consider any conjunction ‘midi... . . Unon', Where pu; for each 
7 is a string of quantifiers and ¢; is without quantifiers. Suppose 
further that, in conformity with Method B, no variable occurs both 
free and in an existential quantifier, nor twice in existential quanti- 
fiers. Let 'v.;' be the functional normal form of "u.¢;', for each 7. 
Now what is to be shown is that there is a prenex ‘ud! equivalent 
to ‘mdi... .. Unda ', Whose functional normal form has an in- 
consistent conjunction of lexical instances if and only if ‘my... . 
-YnW,' has an inconsistent conjunction of lexical instances. The 
equivalence of Methods A and B then becomes evident. 

We may assume without loss of generality that the universal 
quantifiers in y1, . . . , Mn are lettered as we please; for the choice 
of letters in that quarter obviously has no effect on the lexicon and 
the lexical instances. Let us suppose then that in each of uw, ... , 
un the leftmost universal quantifier (if any) is uniformly "(a)", the 
next (if any) is '(a2)', and so on, where a, az, . . . are distinct 
from the existential variables and the free variables. 

By our constructions, any variable which is existentially quanti- 
fied in yp; is foreign to "u;¢;' for all 7 * t. Therefore "midi... . 
- Unga is logically transformable by bringing out all those existen- 
tial quantifiers of y;, for each 7, which are not preceded in yp; by 
(a1) |, and letting them govern the whole conjunction. Within the 
conjunction thus governed, each conjunctional component either 
begins with '(a,) ‘or has no occurrences of a;. But then, in view of 
the laws ‘(x)Fx . ()Ga .= (x)(Fx.Gzx)’ and ‘(2)Fx.p.= (x)(Frz 
. p)’, we can pull out '(a)' everywhere and let a single occurrence 
of '(a:)' govern the whole conjunction. Next, repeating the proc- 
ess, we can bring forward all those further existential quantifiers of 
ui, for each 7, which are not preceded in yp; by '(a2)'. Next we can 
pull out ' (a2) ' everywhere, in favor of a single occurrence of '(a:) ! 
governing the whole conjunction; and so on. We end up with a 
prenex equivalent ‘u(d: . . . da) Of "mdi. . 2. . Unda ', Where pu 
is composed thus: first come all existential quantifiers of u;, for all 2, 


6 The above argument is akin to Gédel’s completeness argument (1930) as 
extended by Dreben. See also next paper. The argument incidentally proves 
Léwenheim’s theorem, since U is clearly finite or denumerable. 
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which were not preceded in pu; by '(a1)"; then comes '(a;)"; then 
come all further existential quantifiers of u;, for all 7, which were 
not preceded in pu; by ' (a2) '; and so on. 

Next let us reflect on the functional normal form ‘vv; ! of "uid; |. 
Clearly v,, for each 2, will be "(a1) . . . (au;)' where u; is the num- 
ber of universal quantifiers in y;; and y; will differ from ¢; only in 
that each variable 8 of ¢; which had been existentially quantified 
in pu; is decorated in y,; with a string of subscripts "a1az, . . .' cor- 
responding to all those universal quantifiers "(ai)", "(az2)4... 
which preceded ' (38) ‘in pi. 

What now of the functional normal form of "u(¢i . . . on)? 
Obviously its string of quantifiers will be "(a;) . . . (a,)' where k 
is the largest of w, . . . , Un. What comes after the quantifiers will 
be formed from '¢, . . . ¢,! by inserting subscripts. Specifically, 
within any component ¢; of "$1 . . . ¢@n! the string of subscripts 
"aia. . . .' which is to be attached to a variable 8 will correspond 
to the universal quantifiers which preceded '(38)' in p. But the 
universal quantifiers preceding '(38)'in uv are, by construction, the 
same as those which preceded '(36)1in w;. Therefore what "¢, .. . 
gn ' gives way to is precisely "f . . . wn '; so the functional normal 
formmon md)... dn) 1S (ai)... fadGn.. . mye 

We have only to remind ourselves of the definition of ‘lexicon’ to 
observe that the lexicon of a functional normal form is uniquely 
determined once we are told what complex terms (i.e., letters with 
subscripts) that functional normal form contains, and what free 
variables without subscripts. Similarly for a conjunction of func- 


tional normal forms. But then ‘(ai) ... (az)(i. . . wa)! and 
es an Yntn | have the same lexicon. For, their complex terms 
are the same, viz., all complex terms of 'y, . . . ¥,'; and their free 


variables without subscripts are the same, viz., all those of "1... 
Yn! except ai, ... , Qh 


Hence each lexical instance of "(a1) ... (an)... Wn) 
being a result of substituting in'y, .. . ¥,'forai, ... , ax from 
that lexicon, is a conjunction of lexical instances x1, . . . , Xn Of 
i) ae vatv,'; and conversely each lexical instance x; of 
We <  s - YValn' is a conjunctional component x; of a lexical 
instance "xi... xn! of "(a1) . . . (an)... wn)" 

But it then follows that "(a,) ... (ax). . . Wa)’ will have 


an inconsistent conjunction of lexical instances if and only if 
We. ws Van | has, q.e.d. 


XIX 


«t Interpretations of 
Sets of Conditions 


The celebrated theorem of Léwenheim and Skolem tells us that 
every consistent set S of quantificational schemata (i.e., every set of 
well-formed formulae of the lower predicate calculus admitting of 
a true interpretation in some non-empty universe) admits of a true 
numerical interpretation (i.e., an interpretation of predicate letters 
such that all schemata of S come out true when the variables of 
quantification are construed as ranging over just the positive 
integers). 

Later literature goes farther, and shows how, given S, actually to 
produce a numerical interpretation which will fit S in case S is con- 
sistent. The general case is covered by Kleene.! The special case 
where S contains just one schema (or any finite number, since we 
can form their conjunction) had been dealt with by Hilbert and 
Bernays. Certain extensions, along lines not to be embarked on 
here, have been made by Kleene, Kreisel, Hasenjaeger, and Wang 
(1951). 

My present purpose is expository: to make the construction of 
the numerical interpretation, and the proof of its adequacy, more 
easily intelligible than they hitherto have been. The reasoning is 
mainly Kleene’s, though closer in some ways to earlier reasoning 
of Gédel. 


1. The General Numerical Interpretation. Our task here is to 


Reprinted from the Journal of Symbolic Logic, Volume 19, 1954. 
11952, 389-398; also 431, lines 22-26. 
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present a numerical interpretation of any consistent class S of 
quantificational schemata. We may think of the membership of S 
as making up an infinite progression of schemata, ordered in famil- 
iar lexicographic fashion. The case of a finite S can be accommo- 
dated by supposing repetition, say with arbitrary relettering of 
bound variables, from the last schema onward. (Also the finite case 
can, of course, be treated separately and more simply, as noted in 
$3.) We may without loss of generality suppose the schemata closed 
(i.e., devoid of free variables aside from predicate letters) and 
prenex (i.e., with all quantifiers initial) and wholly unlike one 
another in the choice of letters for the bound variables. Thus the 


members of S are "yidi', “uodo', . . . where wi, we, . . . are strings 
of quantifiers and qi, @2, . . . lack quantifiers and, for each m, the 
conjunction ‘yidi..... Umém' is equivalent to "mw... Un(dr 


. om)’. Let us suppose further that uw, begins with a universal 
quantifier; such a quantifier can always be trivially prefixed. 

We shall need the auxiliary notion of a substztution sequence for a 
string of quantifiers. To begin with, suppose positive integers, same 
or different, arbitrarily assigned to all the universal quantifiers in 
the string. Then to the jth existential quantifier in the string, for 
each 7, we assign (following Kleene) the number 2/-3%-5¢-. .. 
pr*t, where di, Q2, . . . , @, are the respective numbers assigned to 
the preceding universal quantifiers and p, is the rth prime number 
after 2. Thus for each string of quantifiers there are substitution 
sequences corresponding to all the ways of assigning numbers to 
the universal quantifiers; and there are infinitely many such ways, 
if a universal quantifier is present at all. So, for any string of 
quantifiers one at least of which is universal, we can speak of the 
first, second, . . . substitution sequence, according to a lexico- 
graphic ordering. 

Now let Wm”, for each m and n, be what we get from "¢; . . . dm! 
by substituting numerals for variables according to the nth sub- 
stitution sequence for "mw, . . . um'. (Since w: has a universal quan- 
tifier, Y.% exists for every n.) Thus y,,." for each m and n is a truth 
function of atomic formulae w, we, . . . each of which consists of 
a predicate letter in application to one or more numerals. 

Where ow, w:, . . . ad infinitum are, in a lexicographic ordering, 
all the formulae that can be formed by applying predicate letters 
to numerals, clearly any assignment of truth values to aw, ... , 
w, will either verify or falsify "Wm! . . . Ym"! if ris high enough so 
that w,, . . . , w, exhaust the atomic formulae in ‘Wn? 2. © Yn?! 


SETS OF CONDITIONS 207 


If on the other hand r is not that high, then an —— of truth 
values to w, ... , w, may still verify or falsify "Y,' ~ Vent! 
outright (for all values of the further atomic formulae in a 
Yn"'), or it may do neither (pending assignment of values to sae 
further atomic formulae in ‘Wp! . . . Ym"'). Let us call an assign- 
ment of truth values to aw, ... , w, pec if there are no m 
and n such that it falsifies "Yn! . . . Y."' outright. 

To specify a numerical interpretation of S, or {"yidi', “pode 

. }, itis necessary and sufficient to say what numbers or num- 

ber sequences, as arguments, are to verify and falsify the several 
predicate letters. In short, it is necessary and sufficient to give 
truth values to w1, we, .... We are now ready to specify, in 
that way, a numerical interpretation {&¥ which fits S if S is consist- 
ent. It is this: for each r, w, 1s to be true tf and only wf there 1s an 
innocuous assignment to a, . . . , wr and it, or the earliest such if 
there are several, assigns truth to w,. ‘Earliest’ here is to be under- 
stood in terms of the familiar ordering: 


PM Ue MT lee Vek T, T con. 1 Rall, 
cage Th ate inal: 


2. Proof that & Fits S if S Is Consistent. 

Lemma 1. If "ud' 7s a closed prenex consistent schema and y" for 
each n 1s the result of substituting numerals in @ according to the nth 
substitution sequence for pw, then, for each n, "y'.. . ¥*! ts truth- 
functionally consistent. 


Proof. Where » has k quantifiers, let x; for each 7 from 1 to k be 
formed from ‘u¢' by dropping the first 7 quantifiers and making 
substitutions for the recurrences of the variables of the dropped 
quantifiers according to the first substitution sequence for y; using, 
however, new and distinct variables a1, a2, . . . instead of ‘1’, ‘2’, 

. In like fashion let xz4:, xox4:, . . . be formed according to 
the second, third, . . . substitution sequence. Let yo be "ud". For 
each 7 2 0, therefore, x;+: is got from x; (or perhaps from x0) by 
dropping the first quantifier and putting a, (for some r) for the 
recurrences of the variable of that quantifier. There are three possi- 
ble Cases: (1) the dropped quantifier is universal, (2) it is existen- 


tial and a, is foreign to x1, . . . , x;, or (3) it is existential and a, 
occurs somewhere in x1, . . . , xj. In Case 1, "xo . . . x;!' implies 
Xj+1; in Case 2, ‘xo . . . xj! implies "(da,)(xo . . . x;41) |; and in 


Case 3, x;41 IS a mere repetition of one of x1, . . . , x; (by virtue 
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of the structure of substitution sequences). In all three cases, there- 
fore, the existential closure of "xo . . . x;' (formed by prefixing 
existential quantifiers for all free variables) implies that of "yo . . . 
x;+1 '. This happens for all 7. Hence xo (which is, vacuously, its own 


existential closure) implies the existential closure of "yo . . . xXnx!. 
Hence, if xo is consistent, "xo . . . xn! is truth-functionally con- 
sistent; then so is "xx... Xne'; and then so is 'y!. . . y*) 
this being the same thing with ‘1’, ‘2’, . . . inplaceofai,a., .... 


Hereafter (though not in the above lemma) the notation refers 
back to §1. 


Lemma 2. If m > 1, then for every j there 1s an n such that Pn 
amplies y;? truth-functionally. 


Proof. The assignment of numbers to universal quantifiers which 
is made in the jth substitution sequence for "yw, . . . w;! is made 
again in some substitution sequence, say the nth, for "mw... pm! 
Moreover the assignments to existential quantifiers of "uw... . 
u;' will be the same in both, since such assignments depend in no 
way ON wisi, - ~~. , Mm. Therefore Yn" will consist simply of y,7 in 


conjunction with some result of substitution on "¢j41 . . . dn! 
Lemma 3. For any h, i, j, and k there are m and n such that “Wm 

Um implies "Wi... Wakbd 2. © wt! truth-functionally. 
Proof. Take m ash +7. Then, by Lemma 2, each of y,/, .. . , 


Yak, Wil, . . . , Wei is implied by one or another of Wn, Yn?, . . . - 
Thus there is a set of 7 + k& or fewer formulas, from among y,,1, 
Wn?, ... , Such that their conjunction implies "y,! . . . wry; 
... Wi’. Then, where ¥,." is the last formula of such a set, "Wm! 
mee implies Wy,’ ..... Wye ee 

For purposes of the next lemma we adopt the following conven- 
tion. Where 2% is an assignment of truth values tow, . . . , wr, let 
us understand [+ as consisting in the assignment of 2% tow, ... , 
w, and truth to w,41; and let us understand %[- as consisting in the 
assignment of 2 to wi, . . . , w, and falsity to wri. 

Lemma 4. Jf % ts the earliest innocuous assignment to wi, ... , 
w,, then U+ or A- 2s the earliest innocuous assignment tow, ... ; 
Gyr+41- 

Proof. If neither %+ nor %- is innocuous, then + falsifies 
"Yi . . . wk and U- falsifies ') . . . Yi for some h, 2, 7, and k. 
But then 2 falsifies "y,! . . . varbd .. . v7! Then, by Lemma 3, 
YW falsifies "Wn! . . . Wa%' for some m and n. But this is impossible, 
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since %{ is innocuous. Therefore 2+ or Y- is innocuous. Moreover, 

by the definition of earliness of assignments, any assignment to w, 

.. . , Mp1 earlier than 2+ must be 8+ or B— for some B which is 

earlier than 2{ and hence not innocuous. But, by the definition of 

innocuousness, if 8 is not innocuous then neither is 6+ nor B-. 

Therefore 2+ or %{- is the earliest innocuous assignment to , 
° » Wr+l- 


From here on to the end of the section, all arguments are to be 
understood as subject to the 


HyYporuHEsis. S 7s consistent. 


Lemma 5. For each m and n, "Wm... Wm"™' is truth-functionally 
consistent. 

Proof. For any m, by Hypothesis, "ui... um(di . . . dm)! i8 
consistent. But this formula is related to 'y,! . . . Wat! asis "ud! 
in Lemma 1 to'Wi.. . yr! 

Lemma 6. For each r there 1s an innocuous assignment to wi, . .. , 
We. 


Proof. Suppose neither the assignment of truth nor the assign- 
ment of falsity to a, is innocuous. For some h, 7, 7, and k, then, 
assignment of truth to «, falsifies "y,! . . . ¥,*! and assignment of 
falsity to ow falsifies "yt... 7. Then "yy! ... wtpt.. . 
v7" is truth-functionally inconsistent. Then, by Lemma 3, 'y,,! 
.. » Vm" is truth-functionally inconsistent for some m and n. But 
this contradicts Lemma 5. Therefore Lemma 6 holds when r is 1. 
From Lemma 4, then, it follows by mathematical induction that 
Lemma 6 holds for all r. 


LemMa 7. & makes Wm" true for all m and n. 


Proof. Let k be the highest number such that w; occurs 1n Wn". By 
Lemma 6 and the definition of S, $ agrees on w, (for each r) with 
the earliest innocuous assignment to a, ... , w,. By Lemma 4, 
then, {¥ agrees on each of w, . . . , wz with the earliest innocuous 
assignment to a1, ... , w;. But that assignment makes y,," true. 


Tuesis. For each m, &$ makes ‘tmdm' true. 


fomco "py... Hm’ has the form "(a;) .. 4% (@a)GBi) 4. . 
(3Bs)(v1) . . - (ve)(451) . . . (Ada) . . .. Hence truth of "uw... 
Lm(oi . . . dm) ' (in the universe of positive integers) amounts, by 


the meaning of quantification, to this: for each choice of numerical 
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substitutions for a1, . . . , aa there are numerical substitutions for 
Bi, . . . , B such that for each choice of numerical substitutions 
for7yi, ... , Ye there are (and so on) such that "¢; . . . ¢m! comes 
out true. But, by the definition of substitution sequence, some 
substitution sequence for "ui... um! realizes any desired sub- 
stitutions on a, . . . , a followed by some on f;, . . . , By fol- 
lowed by any whatever on 1, ... , Yc followed by, etc. So “wy 

. bm(bi . . . om)’ will be true if “¢: . . . dn! comes out true 
under substitution according to every substitution sequence for 
Tur... dm; hence if Wm!, Wn, . . . areall true. But, by Lemma 7, 
& makes Wm, Wm?, . . . all true. Therefore $ makes "m .. . pm 
(¢1 . . . om)! true. Therefore § makes "undm! true. 


83. The Finite Case. Completeness. If we are interested in the case 
not of an infinite S but merely of a single schema ‘y¢' (or any finite 
number, since we can form their conjunction), then we can greatly 
simplify §$1-2 as follows. We drop all subscripts from ‘y’, ‘¢’, and 
‘y’, thus speaking simply of u, ¢, and y” instead of "mw. . . um 
"od... . bm, and Wm™. The Hypothesis and Thesis become: 


"ud 'is consistent, & makes "u¢' true, 
and Lemmas 5 and 7 become: 
Lema 5’. Foreachn,"y!.. . y*'is truth-functionally consistent, 
Lemma 7’. § makes ¥* true for all n. 


Similar editing takes place uniformly in the proofs of Lemmas 4, 
6, and 7’ and of the Thesis. Moreover, we can now drop the cita- 
tion of Lemma 3 in the proofs of Lemmas 4 and 6, simply taking n 
at those points as the larger of 7 and k; so Lemmas 2-3 can be 
omitted altogether. The proof of Lemma 5’ reduces to mere 
citation of Lemma 1. Lemma 1 itself, and its proof, remain un- 
changed. 

The simplified line of reasoning which is thus arrived at can be 
slightly varied in turn to give a proof of the completeness and 
soundness of a system of quantification theory. For this purpose 
we drop the hypothesis that "ud! is consistent, and let Lemma 5’ 
itself stand as hypothesis. Now the conditional whose hypothesis 
is Lemma 5’, and whose thesis is that & makes "yd! true, im- 
plies the converse of Lemma 1. So our proofs, thus rearranged, 
comprise in effect simply a proof of Lemma 1 and its converse; 
a proof, in short, that "ud! zs consistent if and only if "Ys... yr" 
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ts truth-functionally consistent for each n. This establishes the 
completeness and soundness of a system of quantification theory 
in which a proof of "~u¢' consists simply in citing a number n for 
which 'y! . . . ¥*'is inconsistent by truth tables. 

The notion of a substitution sequence can be simplified, for pur- 
poses of this system, to the point of practicability: we can number 
the existential variables in the economical way in which Dreben 
numbered his universal ones, instead of bringing in Kleene’s big 
numbers (which, however, were needed in §$1-—2, in order to get 
Lemma 2). The whole theory determined by this present §3 is 
implicit in Dreben’s paper, but the argument here indicated is 
simpler. 


XX 


at Church’s Theorem on the 
Decision Problem 


1. Program. Church showed in 1936 that there can be no de- 
cision procedure for quantification theory.! My purpose in this 
expository paper is to present a more perspicuous and self-con- 
tained account of this important proof than has thus far been 
available in print. The argument is substantially that which I 
have used in the classroom from 1949 onward. 

The theorem, roughly stated, is that 


(1) The class of valid quantzficational schemata is not effective; 


i.e., that there is no general mechanical method for deciding mem- 
bership in that class. Now such a result is demonstrable only inso- 
far as the impressionistic idea of “effectiveness” is made sharp 
and explicit. The sharp concept to this purpose is that of recursive- 
ness (§2), in terms of which Church’s theorem may be stated thus 
strictly: 


(II) The class of Gédel numbers (cf. §2) of valid quantificational 
schemata 1s not recursive. 


However, (II) mainly interests us only as evidence for (I), and 
hence only insofar as we are persuaded of this ancillary ‘“Thesis”’ :? 


This paper, mimeographed in 1954 for the use of my pupils, appears in print 
for the first time. 

1Turing independently proved the same. See also Hilbert and Bernays, 
Vol. 2, 416-421. 

2So called by Kleene. 
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(III) All effective classes of numbers are recursive. 
Now it turns out that by use of (III) and its dyadic analogue: 


(IV) All effective relations of numbers are recursive 


we can get to (I) with much less labor than it takes to prove (IT) 
without (III)-(IV). Such is the plan of this paper. But let me not 
thereby seem to belittle the important fact that (II) can be proved 
without (III)-(IV). 

Recursiveness will be defined and examined in §2, less for 
reasons of unfamiliarity than in order to have certain details ex- 
plicitly before us for reference. In §3 a small theory of functions 
will be developed; it is needed in order to forge a link between 
substitution operations and quantification theory, a link which is 
essential to the proof of (I) or (II). Finally, in §4, (1) will be 
proved with help of (III) and (IV). Meanwhile the intuitive term 
“effective” and the technical term ‘‘recursive’”’ will be kept apart, 
no use of (III) or (IV) being made till the very end. 


2. Recursiveness. An equation will be called rudimentary if 
built up of just these materials: ‘0’, ‘S’ (for successor), ‘=’ (for 
identity of numbers), variables ‘z’, ‘y’, . . . (for positive integers 
and 0), variables ‘f’, ‘g’, . . . (for numerical functions), and 


parentheses (to set off complex arguments of functions). Examples: 


0=0, O=S0, f(gr)(fyx) = g(S(f20)). 

Each of ‘0’, ‘SO’, ‘SSO’, etc., will be called a nwmeral. Where p is 
a function variable and yp, w1, . . . , a, are numerals, the equation 
"ou... - Mn = »' will be called an evaluation of p. Specifically it 
will be called an evaluation of p for 41, ... , Zn aS arguments, 
where 71, ..., %, are the numbers named by mw, ..., mn. 
Example: ‘f(S0)(SS0) = SSSO’ is an evaluation of ‘f’ for 1 and 2 
as arguments. An evaluation will be called positive when, as in 
this example, the right side is not ‘0’; otherwise it will be called a 
zero evaluation. 

By a recursive derivation of an equation ¥ from a conjunction ¢ 
of equations is meant a conjunction of rudimentary equations be- 
ginning with ¢ and ending with y and such that each of the equa- 
tions after ¢ is got, from equations further left, by one or the 
other of these operations: (A) substituting numerals for variables, 
and (B) putting one side of an equation for the other in an equa- 
tion. When such a recursive derivation exists, y is called recursively 
derwable from ¢. A conjunction ¢ is called a recursion if just one 
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evaluation of each function letter of ¢, for each choice of numbers 
as arguments, is recursively derivable from ¢. 

A class of numbers is called recursive if there is a recursion ¢ such 
that zero evaluations of ‘f’ are recursively derivable from @¢ for 
all and only those arguments belonging to the class. (The singling 
out of the letter ‘f’ for this key role is of course arbitrary and in- 
essential.) A dyadic relation of numbers is called recursive under 
parallel circumstances; simply reread ‘‘arguments” now as ‘‘pairs 
of arguments.’ 

To expressions built up of an alphabet of nine or fewer signs, we 
may conveniently assign Gédel numbers as follows: we assign the 
numbers 1, 2, etc., to the individual signs, and then to any com- 
pound expression we assign the number whose arabic numeral is 
got by coneatenating the digits ‘1’, ‘2’, etc., to match the concat- 
enation of signs in &. Alphabets in excess of nine signs can artifi- 
cially be reduced to nine for these purposes. E.g., the alphabet 
used in recursions and recursive derivations may, for purposes of 
Gédel numbering, be conceived as comprising just ‘0’, ‘S’, ‘f, 
‘ry’, ‘=’, the two parentheses, the dot of conjunction, and an 
accent ‘”; for we may treat ‘g’, ‘h’, etc., as ‘f”, ‘f’”, etc., and ‘y’, 
‘2’, etc., as ‘x’, ‘x’”, ete. 

Consider now the relation K of any number z to any number y 
such that y is the Gédel number of a recursive derivation, from a 
conjunction whose Gédel number is z, of a positive evaluation of 
‘f for x as argument. Clearly K is effective—i.c., effectively de- 
cidable for any specific numbers x and y given by numerals. Yet 
the domain of K is not recursive. For suppose it were. Then, 
according to the definition of recursiveness, there would be a 
recursion ¢ such that zero evaluations of ‘f? would be derivable 
from ¢ for all and only those arguments x belonging to the domain 
of K. But this situation is seen to be impossible when x is taken 
in particular as the Gédel number of ¢; for, by the definition of K, 
x belongs to the domain of K if and only if a posztive evaluation 
of ‘f’, for x as argument, is derivable from a conjunction whose 
Gédel number is x.4 


8. Schematic Function Theory. A theory will now be set forth 
which uses ‘2’, ‘y’, etc., as quantifiable variables and ‘f’, ‘g’, etc., as 


3’ The terminology and definitions of the past two paragraphs are adapted 
from Kleene, 1943, 43-45. 
4] have adapted K and the attendant argument from Kleene, 1943, 48-49. 
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function letters not subject to quantification. Because unquantified, 
these latter letters are best viewed as schematic, and the formulas 
of the theory as schemata, valid if true for all functions as inter- 
pretations. The universe of quantification may, but need not, be 
taken as comprised of numbers. The functions appropriate as 
interpretations of the function letters are, at any rate, any func- 
tions having values in that universe for all arguments in that uni- 
verse. Each function letter is to be thought of as n-ary (i.e., as 
taking arguments n at a time) for some n2 0, varying only from 
letter to letter; where n is 0 the letter behaves as a schematic 
constant, or parameter. 

The class of terms is explained inductively thus: quantifiable 
variables are terms of order 0; and an n-ary function letter fol- 
lowed by n terms, some of order 7 and none higher, is a term of 
order 7 + 1. A nullary function letter counts as of order 1. 

ing describing the formulas of this theory let us use ‘a’, ‘8’, and 

y’ to refer to quantifiable variables ‘x’, ‘y’, etc.; let us use ‘p’ to 
at: to ‘f’, ‘g’, etc.; and let us use ‘¢’, ‘n’ aan ‘8’ to fait to any terms. 

Equations "t = 7' will be alee required to have a term 7 of 
positive order, rather than just a variable a, as right side;> thus 
every equation has the form '¢ = pm---m,', where n = 0. The 
shown occurrences of {, m, +--+, tm here will be called the principal 
occurrences of terms in the equation. 

An equation will be called a primitive equation if it has the form 
"a = p6i--:8,'; in other words, if all principal occurrences in it 
are of order 0. The atomic formulas of our theory are to be thought 
of as such equations exclusively; and the formulas generally (pend- 
ing supplementation by an imminent abbreviative definition) are 
just what can be built from those primitive equations by truth 
functions and quantification. Thus other equations, and terms of 
order above 1, have no place in our initial notation. But they are 
brought in by the following contextual 


Definition. If py is an equation with just k principal occurrences of 
terms of positive order, and these are occurrences respectively of £1, 


. , &% then ts short for "(da) . . . Gaz)(ar = & .a2 = £2. 

= (.¢)', where a, . .. , ax are new and distinct and $ 

as like w except for having a, ... , az in place of the respective 
principal occurrences of [1, .. . , Sk. 


6 This restriction simplifies the theory somewhat, without significantly im- 


peace it. Equations of the excluded form & = ri can always be rendered 


a = ¢', barring only the useless species "8 = 
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Any non-primitive equation can be expanded into primitive 
notation by repeated application of the above definition. Each 
application turns an equation y into a formula in which no equation 
contains principal occurrences of terms of as high order as y did. 

The theory is to comprise two classes of axioms. The first con- 
sists of all formulas '(da)(@ = ¢)' such that ¢ is a term of order 1 
lacking a. The second consists of all formulas 'a = ¢.B = ¢.D. 
ga = og'such that ¢ is of order 1 and ¢, and gz are primitive equa- 
tions which are alike except that ¢. has a in one or more places 
where ¢¢ has 8. 

The theorems comprise all the consequences of the axioms 
[better: of their universal closures] by quantification theory. No 
use is made of identity theory, beyond what the axioms themselves 
provide. We turn now to seven metatheorems. 


M1. If «and B are numerical variables, £ 1s a term of order 1 lack- 
ing a, da 18 a primitive equation, and $8 18 da with B for all a, then 
"8 = ¢._). bg = (da)(a = ¢. da) '%8 « theorem. 


Proof. The formula is implied by an axiom of the second class, 
viz., "(a)(vy)(a = f.y = £.D. ba = $,) 'where y is a new variable. 
The form of the implication is: 


Qa: . fy .>.Gr = Gy) Dwr2 Ge = Gaia nee 


which, being a monadic quantificational schema, can be checked 
for validity in known ways. 


M2. If ¢ ts of order 1 and dg and ¢y are equations which are alike 
except that og has a principal occurrence of 8 where dr has one of §, 
then "B = ¢._. og = &'ts a theorem. 


Example: y = fxz .D:9x = hy(fww) .=. gz = h(fxz)(fww). By 
our Definition, this expands into: 


y = frz .D. (au) @v)(u = gx .v = fow.u = hyv) = 
(Ju) (32) (4v)(u = gx. t = faz.v = fuw.u = hr). 


But the formula, thus expanded, is clearly a consequence (by 
quantification theory) of this case of M1: 


y = frxz.Diu = hy .= (dC = frz.u = hiv). 


Any other example of M2 is similarly provable. 
M8. Lf ¢ is of order 1 and Wg and yy are formulae which are alike 
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except that yg has some free occurrences of 8 where ¥; has free occur- 
rences of ¢,° then "B = £.D. Ws = | ts a theorem. 


Proof. Case 1: Only one occurrence of 8 in Wg gives way to ¢ in 
yx. Let os be the equation within Ys which contains that occurrence 
of 8. Correspondingly for ¢r. 

Subcase la: The occurrence of 8 is principal in ¢s. Then "8 = 
¢t... os = ¢' is a theorem by M2, and 'B = ¢.D. Ws = y' fol- 
lows in view of the principle, in quantification theory, of the sub- 


stitutivity of the biconditional.’ 
Subcase 1b: The occurrence is not principal in ¢s. Then let 7. be 


the term which immediately contains that occurrence of 8; and let 
nm be the corresponding part of ¢;. Expansion of ¢s, by repeated 
application of the Definition, eventually gets this occurrence of 
ng over into the context "a = ng'for some variable a; and parallel 
expansion of ¢ gets the occurrence of 7 over into the context 
"a = nr’. But the occurrence of Bin ‘a = ng 'is principal, so we are 
back in Subcase la. 


Case 2: Many occurrences of 8 in Wg give way to ¢ in yy. Then 
there is a chain of formulas, the first of which is Ys and the last of 
which is ¥;, such that each is turned into the next by putting ¢ once 
for 8;so "8B = ¢.D. Wg = y' is deducible from a series of instances 
of Case 1. 


M4. If pts like Wa except for having free £ 1n place of all free occur- 
rences of a, then "(a)Wa > Y' ts a theorem. 


Proof. Case where ¢ is of order 0: Here "(a)2 D ¥ 'holds by pure 
quantification theory. 

Case where ¢ is of order 1: Taking 8 as a new variable, we have 
B=.) i (alle D Wp. =. (aa D %' as a case of M3. But the 
part "(a)We > vp' holds by pure quantification theory; so we are 
left with "8 = £.D. (a)¥a D ¥', and hence "(48)(8 = £) D. (aa 
> y ' But '(38)(8 = ¢)' can be dropped, being an axiom. 

Case where ¢ is of order 2: Here an example will suffice to show 
the reasoning. Let us take ¢ as ‘f(gyv)w’, and represent y. as 


ion? ..%. Suppose ‘’ isoionreigmto‘.....aw...’. By the 
preceding Case (¢ of order 1), these are theorems: 
(1) Qo. a. .Dme. Sew... 


6 A free occurrence of ¢ in y¥; is one within which no occurrences of variables 
are bound in yy. Cf. Mathematical Logic, 142f. 
7 1G@h, *121. 
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(2) wy. . oo...) D.C 
ZYC. ..@ es) Deh. . (Ge ae 


Hence so is: 
(3) @...8@...)D...4iGo...... 


Case where ¢ is of order 3, say ‘f(gy(hzw))w’: Proof similar, start- 
ing with (3) instead of (1). 

Any higher order can be reached by continuing the process. 

M5. If £ ts of positive order and Wg and yx are alike except that Wg 
has some free occurrences of B where Yr has free occurrences of £, then 
"8B =¢.D.e = | is a theorem. 

Proof. ¢ has the form'pm .. . m' Let yi, ... , vx be new and 


distinct, and let @; for each 7 from 0 tok be “py, . . . nryign . . . 
vy.'. By M4 we have 


(ys) (8 = 03-1 nD | Ys = = Vo;-s) Bs B = 0; oS. ve Wo; ' 
as a theorem for each 7 from 1 to k. But we also have '8 = @._. 
ve =v, by M3. So we get "8 = &.D. vs = %,', which is 
"B= 6.D.ve =". 

M6. If nts of positive order and y, and y, are alike except that y; has 
some free occurrences of ¢ where y, has free occurrences of n, then 
=n... = v,'ts a theorem. 

Proof. Let 8 be new. Then '8 = n._. wp = y,'1i8 a theorem by 
M5, and 

"BB =7.D.¥e =) Dif =1.0.% =H ' 
is a theorem by M4; so '¢ = ».D.% = y,' follows. 

M7. If py ts like. except for having free ¢ in place of all free occur- 
rences of a, then "y > (Aa)W.' its a theorem. 

Proof from '(a) —P. > —y', which is a theorem by M4. 


4. Church’s Theorem. I shall next establish the following 


Lemma. If the class of valid quantificational schemata is effective 
then every recursive dyadic relation has an effective domain. 


Let F be a recursive dyadic relation. Then, by the definition in 
§2, there is a recursion ¢ such that any numbers m and n stand in R 
if and only if a zero evaluation of ‘f’, for m and n as arguments, is 
recursively derivable from ¢. 
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In §3 a schematic function theory was developed in which noth- 
ing was stipulated concerning the number of function letters. Let us 
now fix those function letters as comprising just a binary ‘f’, a 
singulary ‘8S’, a nullary ‘0’, and whatever of ‘g’, ‘h’, etc., there may 
be in ¢. The two axiom classes specified in §3 then come to be finite 
(if we omit, as redundant, those variant axioms which differ only in 
the alphabetical choice of ‘z’, ‘y’, etc.). Let y be the conjunction of 
this finite stock of axioms. 

The Lemma will now be proved by showing how, for any number 
m, a formula is effectively specifiable which is quantzficationally 
valid (i.e., which has the form of a valid schema of quantification 
theory, with primitive equations in the role of atomic components) 
uf and only if mis in the domain of R. The formula in question is the 
expansion, by the Definition in §3, of '¢¥ D (ay) (fuy = 0) ' where 
nis ‘0’ preceded by m occurrences of ‘8’. 

For suppose the expansion of '¢y > (ay)(fuy = 0)'is quantifi- 
cationally valid. Then, since ¢ is a recursion and therefore true for 
certain interpretations of ‘f’, ‘g’, etc., and y is true for all, we must 
eonclude that ‘(ay)(fuy = 0)! is true for the interpretation of ‘f’ 
which fulfills ¢, and hence that m belongs to the domain of R. 

Conversely, if m belongs to the domain of R then 'fuy = 0", for 
some numeral y, is recursively derivable from ¢; i.e., derivable by 
(A) and (B) of §2 from ¢. But M6 and M4 show that whatever can 
be got from ¢ by (A) and (B) can be got from '¢y" by pure quanti- 
fication theory, together with the Definition in §3; and, once 
"fuy = O'is got, M7 shows that '(ay)(fuy = 0)' can be deduced in 
turn. Hence ‘gy > (3y)(fuy = 0)", expanded by the Definition, 
must be quantificationally valid. 

From the Lemma, finally, we argue to (I) of §1 as follows. In §2 
we noted an effective relation K whose domain was not recursive. 
It follows by (IV) and (III) of §1 that K is a recursive relation 
whose domain is not effective. In view of the Lemma, then, (I) 
holds. 


XXII 


# Quantification and 
the Empty Domain 


Quantification theory, or the first-order predicate calculus, is 
ordinarily so formulated as to provide as theorems all and only 
those formulas which come out true under all interpretations in all 
non-empty domains. There are two strong reasons for thus leaving 
aside the empty domain. 

(i) Where D is any non-empty domain, any quantificational 
formula which comes out true under all interpretations in all do- 
mains larger than D will come out true also under all interpreta- 
tions in D.! Thus, though any small domain has a certain triviality, 
all but one of them, the empty domain, can be included without 
cost. To include the empty one, on the other hand, would mean 
surrendering some formulas which are valid everywhere else and 
thus generally useful. 

(ii) An easy supplementary test enables us anyway, when we 
please, to decide whether a formula holds for the empty domain. 
We have only to mark the universal quantifications as true and the 
existential ones as false, and apply truth-table considerations. 

Incidentally, the existence of that supplementary test shows that 
there is no difficulty in framing an inclusive quantification theory 
(i.e., inclusive of the empty domain) if we so desire. A proof in this 
theory can be made to consist simply of a proof in the exclusive (or 
usual) theory followed by a check by the method of (ii). We may, 
however, be curious to see a more direct or autonomous formula- 


Reprinted from the Journal of Symbolic Logic, Volume 19, 1954. 
1 Cf. Hilbert and Ackermann, 1938, 92. 
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tion: one which does not consist, like the above, of the exclusive 
theory plus a rule of expurgation. And, in fact, such formulations 
have of late been forthcoming: Mostowski, Hailperin (1953), and, 
as part of a broader context, Church.? I shall not presuppose 
acquaintance with these papers, except in my final paragraph (and 
then only with Hailperin’s). 

Quantification theory may or may not be so fashioned as to 
accord significance to vacuous quantification, i.e., to the attach- 
ment of a quantifier to a formula lacking free occurrences of the 
variable of the quantifier. The three versions cited above all do 
recognize vacuous quantification. Now admission of the empty do- 
main raises a question regarding the vacuous universal quantifica- 
tion ‘(x)f’ of a falsehood ‘f’: Should we regard this quantification as 
false for the empty domain (on the ground that a vacuous quantifier 
is always simply redundant and omissible), or as true for the empty 
domain (on the ground that all universal quantifications are true 
for the empty domain)? Hailperin points out that Mostowski 
implicitly elected the first alternative, but that a more elegant sys- 
tem can be obtained by electing rather the second. 

Hailperin is right, except that he has understated his case. If a 
general semantical characterization of the truth conditions of 
quantification over a domain is phrased along natural lines, without 
special mention of vacuousness of quantification or emptiness of 
domain, the verdict of truth of ‘(x)f’ for the empty domain is pretty 
sure to follow. Moreover, this verdict is mandatory if, in general, 
the vacuous quantification ‘(x)p’ is equated to ‘(x)(p. Fx D Fx)’; 
and we must so equate it if we are to preserve extensionality, since 
‘9 =.p.Fx > Fz’ is tautologous. 

Hailperin recognizes a close kinship of his system of inclusive 
quantification theory to Church’s. But in developing his system 
and establishing its properties he refers in detail to a system of 
exclusive quantification theory in my Mathematical Logic. Now this 
exclusive system runs, in the revised edition, as follows: 


(1) If ¢ is tautologous (by truth tables), K¢. 
(2) LT(a)(oD ¥) D. (e)¢ D (@)y". 

(3) If a is not free in ¢, F"¢ D (a)¢". 

(4) LE '(a)d D of. 

(5) If '¢ > w'and ¢ are theorems, so is y. 


2“A formulation of the logic of sense and denotation,”’ 17f. 
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Here ¢£ is understood as like ¢ except for containing free 6 in place 
of all free a; and ‘+ ¢’ means that the closure of ¢ is a theorem. Free 
variables are not allowed in theorems. 

In the first edition a further principle was included, viz.: 


(6) L'(a)(B)¢ D (6)(a)o". 


However, Berry showed that, by modifying an arbitrary detail in 
the underlying definition of closure, we can dispense with (6). This 
simplification, which I adopted in the revised edition, is a particu- 
larly striking one when we reflect that (6) was the only polyadic 
axiom form in the lot. 

Taking the system of my first edition as point of departure, 
hence (1)-(6), Hailperin builds his system of inclusive quantifica- 
tion theory from it by changing (4) to: 


(7) LT (a)d D (B)¢F" 
and adding: 
(8) If a is not free in ¢, LE"? D (a) .D (a)(@D Wp)". 


Now the main point which I want to make is that Hailperin’s 
inclusive theory, simpler though it is than Mostowski’s, can be 
much simplified in turn. We can drop (6) by Berry’s expedient. Also 
we can drop (8) if, instead of changing (4) to (7), we merely weaken 
(4) by prefixing the hypothesis that a is free in ¢. 

The resulting system of inclusive quantification theory is simply 
the exclusive system (1)-(5) with the words ‘‘If @ is free in ¢” 
inserted in (4). This insertion is wanted so as to except ‘(x)f > f’, 
which fails for the empty domain. 

The completeness of this system of inclusive quantification 
theory can be seen by substantially Hailperin’s proof, which is 
easily adaptable to this system. Another fact which Hailperin 
establishes about his system, viz., that it can be strengthened to 
exclusive quantification theory by adding ‘+ '~(a)f” (or say 
‘ELT ~(a)(¢.~¢)”), is also demonstrable for our simpler system, 
again by substantially Hailperin’s proof. 

The required adaptation of Hailperin’s proofs is twofold, involv- 
ing not only a switch from his system to our new one, but a switch 
also from the first edition of Mathematical Logic, which he cites for 
certain proof patterns, to the revised edition. In detail the changes 
are as follows. In his proofs of his Lemma 1 and Theorem 1, change 
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**104” to ‘'*103.”’ (This change merely reflects a change of num- 
bering in the revised edition.) In his proof of Theorem 1, drop the 
proof of Case 1 in favor of a mere citation of our weakened (4) 
above. In his proof of Lemma 3, change ‘'*112” to “‘*115” and cite 
our weakened (4) instead of QE1. (These changes in the proof of 
Lemma 38 reflect both a change of numbering and a radical change 
of proofs in the revised edition.) In his proof of Theorem 2, read 
“QEO-6” as excluding QE] and QE4—5 and as including the weak- 
ened (4). 


X XIT 


at Reduction to a 
Dyadic Predicate 


Consider any interpreted theory ©, formulated in the notation of 
quantification theory (or lower predicate calculus) with interpreted 
predicate letters. It will be proved that © is translatable into a 
theory, likewise formulated in the notation of quantification 
theory, in which there is only one predicate letter, and it a dyadic 
one.! 

Let us assume a fragment of set theory, adequate to assure the 
existence, for all x and y without regard to logical type, of the set 
{x, y} whose members are x and y, and to assure the distinctness of 
x from {x, y} and {{xz}}. ({x} is explained as {z, x}.) Let us con- 
strue the ordered pair x;y in Kuratowski’s fashion, viz., as { {2}, 
{x, y}}, and then construe x;y;z as x;(y;z), and 2;y;2;w as x;(y;z;w), 
and so on. Let us refer to w, w,;w, w;w,w, etc., as lw, 2w, 3w, ete. 

Suppose the predicates of © are ‘Fy’, ‘F.’, .. . , finite or in- 
finite in number, and respectively di-adic, de-adic, .... Now 
let ©’ be a theory whose notation consists of that of quantification 
theory with just the single dyadic predicate ‘F’, interpreted thus: 


Reprinted from the Journal of Symbolic Logic, Volume 19, 1954. 

1 This result is reminiscent of, but distinct from, two others in the literature. 
One is Léwenheim’s and Kalmdar’s, which has to do not with the translata- 
bility of an interpreted theory, but with the likeness of quantificational 
schemata in point of consistency or inconsistency. The other is that of Church, 
Craig, and Quine, which assures reducibility not merely to a dyadic predicate 
but to a symmetric one; it differs from the present result in supposing the uni- 
verse to consist of natural numbers. See Kalmdr, 1936; Church and Quine; 
Craig and Quine. 
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G1) Fay =: Ge)(x = {y,z}) V.2 =y. Gn)(w)... 
(Sepa,) (Fad: . . . Way. © = aWytoyWyte-3; . . . Adz). 
The universe of ©’ is to comprise all objects of the universe of © 
and, in addition, {x, y} for every x and y in the universe of 0’. (Of 
course the universe of © may happen already to comprise all this.) 
Now I shall show how the familiar notations ‘7 = y’, ‘« = {y,z}’, 
etc., and ultimately the desired ‘Fyw, . . . wa,’, ‘Fou . . . wa,’, 
etc., themselves can all be defined within 0’. The arrow ‘—’ is used 
as a sign of definition. 


DEFINITION. & = y — (2)(Fer = Fzy). 


Justification. Obviously (z)(Fzx = Fzy) where x is y. So it will 
suffice to establish the converse: to assume that (z)(Fzr = Fzy), 
and show that x = y in the ordinary sense of ‘=’. Since {x} is 
{x, x}, clearly (4z)({x} = {2, z}) and hence, by (1), F{x}z. Then, 
by our assumption, F{x}y. Then, by (1), either (az)({x} = {y, z}) 
(whereat x = y) or else {xz} = y. The same argument with ‘x’ and 
‘y’ switched shows that x = y or {y} = x. Combining, 

EPpy ae ey 
Hence x is either y or {{z}}. But x cannot be { {z}}. 


DEFINITION.2 « = {y, 2} > (w)\(w #2. Frw.=:w=y.V. 
Ww 
Justification. We see from (1) that 
wex.Few.= (u)(e = {w, u}). 
But clearly 
x= ty, 2} .= (w)[@u)(z = {w, u}) 


DEFINITIONS. zx = y;z— (Ju)(aw) (x 


w= y.V.w =2i. 


{u,w}.u= ly, y}. 
w= \Y, z}), 

2 = y,z;w > Au) = yu. u = z;w), 

x= y;2z;wju— Av)(2 = yjv.v = 2;w;u), 


and so on. Further 


y=lw—y=u, 
y= 2w- y = u;u, 


and so on. Justifications obvious. 


* The preceding definition of ‘xr = y’ applies only to simple variables, and 
thus does not overlap the present definition or succeeding ones. 
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DEFINITION (for each fixed n and d,). 


Paw... . Wa, — (3) (3y) (Fre. y = nw,. 
L = Y;W1;Wj;WejW3; . . . jWa,). 


Justification. The definiens here amounts to: 


(2) F(nwywi;wiwe;ws; . . . 7Wa,) (NW WW; We;Ws; 6 6. 7Wa,), 
and we want to show it equivalent to ‘Frzw ... wa,’. By (1), 
since x ¥ {z, 2}, 


Fee = (3m)(Qm) . . . (tan) (Pint... Wan» 
C= MU Uj Ue; . . . ae 


Taking x here as nw,;w;wi;We;W3; . . . ;Wa,, We infer that (2) is 
equivalent to: 
(8) (m)Gu)... Gua)(Fnt . . . aw 

NW; W;W1We;W3; . 6. {Wd = MUjUsjUs;U2jUs; . ~~ fUdn)- 


Recalling our convention of parentheses, we see that the equation 
contained in (8) equates certain ordered triples, thus: 


Mow ECD . . . 0s,) = mean; . - . wees 


mu, and w, = uw, and hence also 


and implies therefore that nw, 
that m = n. So (3) reduces to: 


Gis). . . (Gta) oam . . . uae. 
NW1;Wr;Wij;We;W3; . . . SWan = MUU jUesUay 6. fda), 


and hence to: 


(32) Beas (Jug,) (Pu oo « Sey , =, .. We = VS. we 


and hence to ‘Fw, . . . Wa,’ 
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at Variables Explained Away 


As z increases, we are told, 2/x decreases. Since numbers never in- 
crease or decrease, such talk of variables must be taken meta- 
phorically. The meaning of this example is of course simply the 
general statement that if z > y then 2/xz < 2/y. Indeed logicians 
and mathematicians nowadays use the word ‘variable’ mostly 
without regard to its etymological metaphor; they apply the word 
merely to the essentially pronominal letters ‘zx’, ‘y’, etc., such as 
are used in making general statements and existence statements 
about numbers. A characteristic use of such letters is seen in the 
generality prefix ‘every number z is such that’, followed by some 
sentence, usually of the conditional or ‘if-then’ form, containing 
the letter ‘x’. Another characteristic use of such letters is seen in 
the existence prefix ‘some (at least one) number z is such that’. 

The familiar form of stipulation ‘Let z be thus and so’ is usually 
best construed as amounting to a generality prefix and an ‘if’- 
clause. Ordinarily the stipulation prefaces some passage of mathe- 
matical reasoning; and this whole combination can be treated as a 
generalized conditional sentence beginning ‘Every number z is 
such that if z is thus and so then’—and continuing with the main 
body of the passage of reasoning in question. 

The use of ‘z’ as an unknown in mathematical problems comes 


This paper was read by invitation to the American Philosophical Society in 
April 1960. It was promptly published in the Proceedings of the Society and is 
reprinted here by permission. An abstract of it appeared also in the Journal of 
Symbolic Logic for 1959, under the title ‘Eliminating variables without apply- 
ing functions to functions.”’ In that same year, as I afterward learned, a con- 
struction by Bernays to the same purpose appeared (‘‘Uber eine natiirliche 
Erweiterung des Relationenkalkiils’’). I think my scheme more elegant than 
his, but his paper goes beyond mine in that it gives axioms. 
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to the same thing. Such a problem starts out with some initial 
condition on x. Solving such a problem consists in finding an equa- 
tion, ‘s = ... ’, worthy of standing as the ‘then’-clause of a 
generalized conditional sentence whose ‘if’ clause states the given 
initial condition. 

Mathematicians often introduce further letters in the role of 
unspecified constants or so-called parameters, in explicit contrast 
to the so-called variables such as ‘xz’ and ‘y’. Logically these pa- 
rameters can be looked upon still as variables, and contrasted with 
‘7’ and ‘y’ merely in respect of how much text the general sentences 
or existence sentences take in. A typical page involving ‘a’ as 
a so-called parameter, and ‘x’ as a variable explicitly so-called, 
might be analyzed in the following fashion. The whole is governed 
by the implicit generality prefix ‘every number a is such that’. 
Then one or more briefer subsidiary clauses are governed by more 
transitory prefixes ‘every number ~ is such that’ or ‘some number x 
is such that’. Typical talk of parameters can be construed along 
this line without essential difficulty. 

Variables, of course, lend themselves to discourse not only of 
numbers but of objects of any sort. The non-numerical prefixes 
‘every set x is such that’, ‘every person z is such that’, ‘some coun- 
try x is such that’, have equal rights with the prefixes that talk of 
numbers. 

Nor are variables necessarily tied up with generality prefixes or 
existence prefixes at all. Basically the variable is best seen as an 
abstractive pronoun: a device for marking positions in a sentence, 
with a view to abstracting the rest of the sentence as predicate. 
Thus consider the existence statements ‘Some number «# is such 
that x is prime’ and ‘Some number ~ is such that xz? = 32’. The 
variable is conveniently dropped from the first: we may better say 
simply ‘Some number is prime’, because in ‘x is prime’ the predicate 
‘is prime’ is already nicely segregated for separate use. The variable 
can be eliminated also from the second example, but less conven- 
iently: we could say ‘Some number gives the same result when 
cubed as when trebled’, thus torturing the desired complex predi- 
cate out of ‘x? = 32’ with a modicum of verbal ingenuity. In more 
complex examples, finally, use of ‘x’ is the only easy way of ab- 
stracting the jagged sort of predicate which we are trying to say 
that some number fulfills. Where the variable pays off is as a de- 
vice for segregating or abstracting a desired predicate by exhibiting 
the predicate sentencewise with the variable for blanks. 
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The variable is invaluable still as abstractive pronoun in places 
where generality and existence are not the point. Thus consider 
singular descriptions, as logicians call them: phrases beginning 
with the singular ‘the’. We say ‘the square of 2’, ‘the author of 
Waverley’, without variables; but in ‘the number x such that’, 
followed by some complex condition on ‘x’, we need the variable as 
abstractive pronoun just as urgently as in the corresponding 
existence statement. 

As a point now of theory and not of practical convenience, it can 
be interesting to inquire whether the variable is in principle dis- 
pensable. We were able to avoid the variable as abstractive pronoun 
in the case of ‘x? = 3x’ by torturing ‘x? = 32’ into ‘x gives the 
same result when cubed as when trebled’. Once we had coaxed the 
dummy letter ‘x’ thus into suitable position and segregated the 
rest, we ceased to need the ‘zx’. In this example the coaxing de- 
pended on such auxiliary words as ‘gives’, ‘result’, and ‘when’, 
along with participial endings of verbs. Now my question is whether 
a general, finite battery of such auxiliary operators can be as- 
sembled that will enable us always to coax variables thus into posi- 
tions where we can dispense with them. 

The answer is affirmative, as I shall show. The interest in carry- 
ing out the elimination is that the device of the variable thereby 
receives, in a sense, its full and explicit analysis. There is no thought 
of denying ourselves the continuing convenience of variables in 
practice. 

I shall need henceforward to talk continually of predicates and 
predication in the following regimented way. An n-place predicate 
is a sign that attaches to a string of n subjects to make a sentence; 
and a sentence so formed is called a predication. Thus we may have 
a three-place predicate ‘F’ of distance comparison, where the predi- 
cation ‘Fxyz’ means that z is farther from y than from z. We may 
have a two-place predicate ‘B’ of biting, where the predication 
‘Bay’ means that x bites y. We may have a one-place predicate ‘D’ 
of doghood, where ‘Dx’ means that x is a dog. A no-place predicate, 
if we may force our terminology a bit, would be a sentence as it 
stands. 

Given the two-place predicate ‘B’, which is the transitive verb 
‘bites’, let us now contrast two styles in which we might say that x 
bites something. The one style uses an existence prefix; the other 
style uses a certain operator on predicates. The one style is familiar, 
and consists in two steps: first we form a predication ‘Bxy’ and 
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then we apply to it the existence prefix ‘Something y is such that’. 
The other style is opposite in order: first we make a new predicate, 
a one-place predicate meaning ‘bites something’, and then we use 
it and ‘a’ to form a predication ‘x bites something’. For this style 
we need an operator which can be applied to a two-place predicate 
or transitive verb ‘B’, ‘bites’, to produce a one-place predicate 
‘bites something’. Let us call this operator that of derelativization 
and write it ‘Der’. Thus ‘Der B’ is the one-place predicate or in- 
transitive verb of biting, or biting something, and the predication 
‘(Der B) x’ means that x bites something. 

I remarked that the essential utility of variables is that they 
mark positions. This point becomes vivid when we contrast the 
derelativization operator with the existence prefix, which used a 
variable ‘y’. The two devices are alike in enabling us to say that z 
bites something. But the existence prefix with its variable has the 
advantage of enabling us to say also such further things as that 


Something zx is such that something y is such that Bry, 
Something y is such that Byz, 
Something y is such that Byy, 


whereas our derelativization operator only takes care of the case 
‘Something y is such that Bry’. To make the derelativization opera- 
tor suffice in lieu of existence prefixes and variables, what are 
needed are certain further operators capable of coaxing a variable 
into the right position. Also an extension of the derelativization 
operator itself is needed; let me begin with that. 

So far I have explained derelativization as applying to a two- 
place predicate ‘B’, ‘bites’, to produce a one-place predicate ‘Der 
B’, ‘bites something’. Let us now explain it as applying similarly to 
a one-place predicate to produce a no-place predicate, or sentence, 
which simply affirms existence: ‘Der D’ means simply that there 
are dogs. Then, since ‘(Der B)z’ means that x bites something, 
‘Der Der B’ means that something bites something. This disposes 
of our example ‘Something z is such that something y is such that 
Bry’. 

With an eye now to the next of the above examples, ‘Something 
y is such that Byx’, we add an operator of znversion. This operator 
may be described as turning a transitive verb or two-place predi- 
cate from active to passive: ‘(Inv B)zy’ means that Byz. Thus 
equipped, we can rectify ‘Something y is such that Byz’ to read 
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‘Something y is such that (Inv B)zy’, whereupon we can bring de- 
relativization to bear; the whole gets translated as ‘(Der Inv B)z’, 
devoid of the existence prefix and its ‘y’. 

Our further example ‘Something y is such that Byy’ prompts the 
adoption of yet a third operator, that of reflection. It turns the two- 
place predicate ‘B’, ‘bites’, into the one-place predicate ‘Ref B’, 
‘bites self’. Thus ‘(Ref B)y’ means ‘Byy’. Then instead of ‘Some- 
thing y is such that Byy’ we can write ‘Something y is such that 
(Ref B)y’ and hence simply ‘Der Ref B’. 

These simple examples already illustrate the essential trick of 
the general elimination of variables. Let us now generalize. 

Our three operators—‘Der’, ‘Inv’, ‘Ref’—need to be generalized 
for application to predicates of more than two places. I shall 
generalize them in the least imaginative way, simply supplying 
the extra places as inert background. Even so, there are in the case 
of inversion two such generalizations both of which wil! be wanted. 
Altogether, then, our four operators on predicates may be de- 
scribed succinctly as follows, where ‘P’ represents any n-place predi- 
cate: 


Derelativization: (Der P)x, . . . 2n-1 if and only if there is some- 
thing x, such that Px, . . . 2p. 

Major inverston: (Inv P)z ... tn if and only if Pram... 
Ly —le 

Minor inversion: (inv P)x...2, if and only if Pm... 
TnL ntn—-1- 


efeciion: (Rei P)a, . . . seeyif andvonly if Pz. . . Ate. 


We saw that ‘Something y is such that Bry’ and the three kindred 
examples could be rendered respectively as ‘(Der B)x’, ‘Der Der B’, 
‘(Der Inv B)z’, and ‘Der Ref B’. Let us now try our four generalized 
operators on the more serious example: 


Something x is such that something y is such that Pyzyz. 


By transforming the part ‘Pyzxyz’ first into ‘Gnv P)yxzy’, thence 
into ‘(Inv inv P)zzyy’, and thence into ‘(Ref Inv inv P)rzy’, we 
reduce the whole sentence to: 


Something z is such that (Der Ref Inv inv P)zz, 


which reduces in turn to: 
Der Ref Der Ref Inv inv P. 
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More generally, we can claim this of our four operators: they 
enable us to get rid of existence prefixes and their variables when- 
ever, as in the above example, there are only the existence prefixes 
and one predication. The reasoning is as follows. It is easily seen 
that major and minor inversion suffice to permute any number of 
subjects into any desired order; and then reflection suffices to re- 
solve repetitions, when they are permuted to terminal position. 
Finally derelativization takes care of each existence prefix and its 
variable in terminal position. 

We have still to worry about existence prefixes governing sen- 
tences which are compounded of predications, e.g., in the fashion 
‘Bry and not Fw2z’. To handle such cases, we need these two 
further operators on predicates: 


Negation: (Neg P)x, . . . x, 4 and only if not (Pa, . . ear 
Cartesian multiplication: (P X Q)a1... Umi... Yn if and 
Gmiy tt Tear... Em ANON . . . Sew 


Using these, we can express our example ‘Bry and not Fwxz’ as 
a single predication ‘(B X Neg F)xywzz’. (In reading this we have 
to know, of course, that ‘B’ is two-place and ‘F’ is three-place.) 

Our operators on predicates are now six. They enable us to get 
rid of an existence prefix and its associated variable when what 
the prefix governs is constructed by ‘not’ and ‘and’, as complexly 
as you please, from any number of predications. 

I can put this more strongly. Suppose we have a language of 
the following form. Its simple sentences are predications, formed 
of predicates and strings of variables. Its compound sentences are 
built up of such predications by repeated use of just three devices: 
‘not’, ‘and’, and existence prefixes. These prefixes are ‘something x 
is such that’, ‘something y is such that’, etc., with no restriction of 
the objects to special categories such as numbers or persons; the 
universe may be conceived widely or narrowly, but it is to be the 
same for every existence prefix. A language thus simply consti- 
tuted I shall call standard. Standard languages differ from one 
another only in their stock of predicates and in how the universe is 
chosen. Now we can say this of our six operators on predicates: 
these, if added to a standard language, enable us to rid the lan- 
guage of existence prefixes and variables altogether. Given any 
sentence of a standard language, we go to work on an innermost 
existence prefix: one whose governed clause contains no existence 
prefixes. Our six operators on predicates enable us to eliminate it 
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and its variable. Then we deal with any surviving innermost exist- 
ence prefix, and, working thus outward, eventually make a clean 
sweep of all existence prefixes and their variables. 

Thus consider the sentence ‘Some men read no books’. In stand- 
ard language it is: 


Something z is such that (Mz and not something y is such that 
By and Rzy), 


for obvious interpretations of ‘M/’, ‘B’, and ‘R’. Now the part ‘By 
and Rzry’ can be transformed successively thus: 


(BX R)yzy, [Inv(B X F)|xyy, [Ref Inv (B X R)]zy. 
The whole sentence ‘Some men read no books’ then becomes: 

Something z is such that {Mz and not [Der Ref Inv(B X R)]z}, 
which can be further transformed into: 

Something x is such that [M7 X Neg Der Ref Inv (B X R)]zz, 


and finally into: 
Der Ref[M X Neg Der Ref Inv(B X R)}. 


This, it will be said, is an illustration in miniature of how vari- 
ables might be eliminated from serious theories. Actually it is more: 
it is already a solution of the general case.! For there is evidence 
that what I have called the standard form of language is, despite 
its apparent poverty, an adequate medium for scientific theories 
generally. Some illustrative translations will clarify this point. 

Existence prefixes suffice to the exclusion of generality prefixes. 
Thus consider the generality prefix ‘every number is such that’. 
Using ‘not’ and an existence prefix, we can paraphrase it as ‘not 
some number « is such that not’. 


1 The first general elimination of variables was due to Schénfinkel, 1924. His 
operators operate on themselves and one another, whereas our six operate only 
on the original predicates and on the predicates thence derived by the oper- 
ators. His presuppose an abstract universe equivalent to that of higher set 
theory, whereas ours make no ontological demands, being even retranslatable 
into ‘not’, ‘and’, and existence prefixes. An apparatus equivalent in scope to 
ours is Tarski’s cylindrical algebra, when the number of its dimensions is taken 
as infinite but locally finite; see Henkin, “The representation theorem for 
cylindrical algebras.’’ But here again there is a radical difference in approach. 
In a way our operators are reminiscent also of the axioms of class existence in 
von Neumann, 1925, despite dissimilarity of purpose. See also my ‘“Toward a 
calculus of concepts.’’ Along Schénfinkel’s line much research has meanwhile 
been done, mainly by Curry. 
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Existence prefixes suffice to the exclusion also of the prefixes of 
singular description. Thus consider the singular description ‘the 
number x such that x + x = x’. Whenever it is used, it is used in 
one or another sentence that says something further about the 
described number; e.g., that it is less than 1. Now instead of saying 
that the number xz, such that x + x = g, is less than 1, we can 
resort to an existence prefix and say merely that some number z is 
such that x +a = 4x and x < 1. If with an eye to the ‘the’ of 
singular description we want also to affirm uniqueness—that only 
one number 2 is such that x + x = x—we can add a further sen- 
tence to that effect. It too can ultimately be formulated without 
using variables otherwise than in connection with existence pre- 
fixes.? 

I urged earlier that the variable is best understood as an ab- 
stractive pronoun. Still one finds, as in these examples, that all 
contexts which call for variables can be warped around into ones 
in which variables are used solely with existence prefixes. Alterna- 
tive paraphrases are available too, herding the variables into other 
types of construction; but we can rest with the existence prefixes. 

We can do better: we can adhere to the unrestricted form of 
existence prefix ‘something x is such that’, as against the restricted 
form ‘some number x is such that’ and the like. For, ‘some number 
x is such that’ can be paraphrased as ‘something z is such that z is 
a number and’. Such, indeed, was our treatment of men and books 
in transforming ‘Some men read no books’. 

This much suffices perhaps to illustrate that the standard form 
of language is adequate so far as the role of variables is concerned. 
Variables can be seen as adjuncts purely of existence prefixes, and 
unrestricted ones at that. 

Another economy of standard language was that apart from 
existence prefixes it recognizes only ‘not’ and ‘and’ as means of 
building sentences from sentences. Ways have long been familiar 
whereby ‘not’ and ‘and’ can be made to do the work of various 
further sentence connectives: “p or q’ can be rendered ‘not (not p 
and not q)’, and ‘if p then q’ can be rendered ‘not (p and not q)’. 

These and further reductions to what I have called the standard 
form are familiar to logicians.’ All branches of classical mathemat- 

2 Not some number z is such that [z +2 = x and some number y is such 
that (@ ¥ yandy +y = y)]. 

3The modern logic of quantifiers—i.e., generality prefixes and existence 
prefixes—dates from Frege, 1879. So do the various reductions noted in the 


past few paragraphs, except that the eliminability of singular descriptions was 
noted rather by Russell, 1905. 
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ics can be put into standard form, and so can all other branches of 
theory that would be at all generally regarded as having attained 
to explicit scientific formulation. This is abundantly borne out by 
literature on the logical regimentation of mathematics and other 
scientific discourse. 

Singular terms seem a mainstay of language. We continually use 
proper names, and also complex singular terms such as ‘the author 
of Waverley’ and ‘x + y’. It is worth noticing, then, that our stand- 
ard form has none of these—no singular terms except the simple 
variables themselves. It has long been known that by suitable choice 
of predicates we can dispense with singular terms other than vari- 
ables. The main step in that argument is the elimination of singular 
descriptions that was illustrated above. 

And now our new reduction dispenses even with the variables. 
There cease to be singular terms at all; there remain only the predi- 
cates themselves and our six fixed operators upon them. Each sen- 
tence fares substantially as our example ‘Some men read no books’ 
was seen to fare. 

It will be said that there may still survive, in examples other than 
that one, the sentential operators ‘not’ and ‘and’. But we can dis- 
claim these, for they are best viewed as merely those cases of ‘Neg’ 
and ‘X’ where the predicates to which ‘Neg’ and ‘X’ are applied 
happen to be no-place predicates, i.e., sentences. 

We end up with a universal algebra purely of predicates, com- 
prising just our six operators and any arbitrary predicates as gen- 
erators for them to operate on. This is a general logical notation. 
It is devoid of variables, yet theoretically adequate as a framework 
for theories generally, mathematical and otherwise. To fix it as a 
notation for any specific subject matter we merely supply the ap- 
propriate vocabulary of specific predicates, leaving the outward 
framework unchanged; and that framework consists of our six 
operators, nothing more. 
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